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FOREWORD 


In  conducting  the  research  described  in  this  report,  the 
Investigator(s)  adhered  to  the  "Guide  for  the  Care  and 
Use  of  Laboratory  Animals",  prepared  by  the  Committee 
on  Care  and  Use  of  Laboratory  Animals  of  the  Institute 
of  Laboratory  Animal  Resources,  National  Research  Council 
(DHEW  Publication  No.  (NIH)  86-23,  Revised  1985). 

The  i n ve s t i ga t o r ( s  )  have  abided  by  the  National  Institutes 
of  Health  Guidelines  for  Research  Involving  Recombinant 
DNA  Molecules  (April  1982)  and  the  Administrative  Prac¬ 
tices  Supplements. 


SUMMARY 


The  molecular  basis  of  Infectivity,  cy  topathoiKeni  clt  y  and 
genomic  activation  of  HIV-1  was  investigated.  Site  directed 
mutagenesis  and  restriction  endonuclease  cleavage  followed  by 
Bal  31  digestion  was  used  to  generate  deletion  mutations  in 
sor ,  3  ’orf  ,  the  5'  packaging  region  and  LTR.  The  biological 
activity  of  the  mutations  was  assessed  by  transfecting  the 
mutants  into  permissive  cells  and  examining  the  cells  for 
expression  of  viral  antigens,  reverse  transcriptase,  viral 
particles,  and  infectivity.  Analysis  of  these  mutants  to 
date  show  that  the  sor  gene  is  not  absolutely  required  for  HIV 
virion  formation  but  influences  viral  transmission  in  vitro 
and  is  crucial  for  the  generation  of  infectious  virus. 

Analysis  of  3  *  or f  activity  indicated  that  deletions  in  the 
amino  terminus  (which  also  results  in  truncation  of  the 
carboxyl  terminus  of  gp41)  result  in  significant  reduction  in 
viral  p r op aga t ion / i nf e c t 1 vl t y .  Studies  with  the  LTR  sequence 
have  delineated  an  enhancer  activity  on  Sp-1  binding  site  and 
the  tat  response  region  (TAR). 

Within  the  last  two  years  several  new  retroviral  isolates 
have  been  obtained  which  exhibit  a  high  degree  of  homology 
with  the  human  immunodeficiency  virus  type  ]  (HlV-1).  The 

discoverey  of  a  simian  virus  ( STLV- 1 1  I^qj,^  )  and  a  second  class 
of  human  retroviruses  (HIV-2)  which  are  highly  related  to 
each  other  and  distinctly  related  to  HIV-1  has  led  researchers 
to  hypothesize  that  these  viruses  all  diverged  from  a  common 
ancestor.  Furthermore,  these  new  viral  isolates  appear  to 
be  non-pathogenic .  In  order  to  determine  the  differences  that 
exist  between  these  viruses  and  HIV-1,  we  have  generated  lambda 
phage  clones  of  the  integrated  provirus  for  both  STLV-III^q]^ 
and  HIV-2fqm.  The  nucleotide  sequences  of  these  genomic  clones 
have  been  determined  and  compared  with  HIV-1  and  each  other. 

The  deduced  amino  acid  sequences  have  also  been  compared  with 
that  of  HIV-1.  Structurally,  these  viral  isolates  are  very 
similar  to  HIV-1  except  for  a  premature  stop  codon  in  the 
envelope  gene  and  an  extra  reading  frame  designated  X.  The 
homology  of  the  nucleic  acid  sequence  between  STLV-III  and 
HIV-1  is  55%  overall,  whereas,  STLV-III  and  HIV-2rod  exhibit 
an  overall  homology  of  75%.  This  sequence  data  indicates  that 
these  isolates  do  belong  to  the  same  group  of  retroviruses 
as  HIV-1.  A  comparison  of  the  deduced  amino  acid  sequence  of 
the  genes  encoded  for  in  the  provirus  confirms  a  conservation 
of  the  protein  structure. 
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a  cloned  genome  and  an  1 n  vitro  system  of  cultivating  virus 
has  permitted  extensive  analysis  of  the  function  of  the  viral 
genome  and  the  viral  proteins  for  which  it  codes. 

The  HIV  genome  consists  of  a  9.7  kilobase  RNA  molecule, 
in  which  eight  genes  have  been  Identified  to  date.  The  three 
structural  genes,  gag ,  po 1 ,  and  en v  are  common  to  all  retro¬ 
viruses  and  constitute  the  largest  open  reading  frames  of 
the  genome.  In  addition,  the  virus  contains  t at -I II  and 
art /t  rs  which  represent  regulatory  genes  for  virus  replica¬ 
tion,  and  three  genes  of  undefined  function  sor ,  3  *  o  r f ,  and 
R  (3). 

In  the  course  of  this  contract,  efforts  have  been  concen¬ 
trated  on  elucidating  the  function  of  the  nonst ructural 
genes,  particularly  t  at -II I ,  sor  and  3  *  o  r f .  Tat -1 1 1  will 
be  discussed  more  fully  in  Section  B  of  this  report.  The 
sor  gene  (for  short  open  reading  frame)  of  HIV  (also  called 
Q,  P_^,  Or  f  - 1  ,  and  Or  f  A )  lies  between  the  po  1  and  tat  genes 
overlapping  at  its  5'  end  with  p o 1  (4).  It  is  an  open 
reading  frame  of  609  nucleotides  in  size  and  encodes  a 
protein  of  23  Kd  (5).  This  gene  appears  to  be  conserved 
among  all  HIV  isolates,  in  the  distantly  related  SIV,  and 
in  other  retroviruses  suggesting  that  this  gene  may  be  func¬ 
tionally  significant.  Preliminary  studies  with  sor  deletion 
mutants  of  HIV,  however,  suggested  that  it  is  not  essential 
for  virus  replication  or  cytopathic  effects.  Although  small 
reductions  in  infectlvity  were  noted,  the  significance  of 
this  observation  was  not  well  understood. 

The  3  *  o  r f  gene  (for  3’  open  reading  frame)  has  the 
coding  capacity  of  a  protein  of  about  27  Kd  and  spans  the 
region  between  e n v  and  3'  LTR  overlapping  both  genes. 
Preliminary  studies  in  Dr.  Robert  Gallo's  lab  with  deletion 
mutants  in  3 ' orf  suggested  that  this  gene  may  be  important 
in  virus  Induced  cell  killing. 
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A  non  cytopathic  HTLV-III  clone,  designated  XlO-1,  was 
generated  from  the  cytopathic  genome  of  pHXB2D  by  removing  a 
200  base  pair  segment  around  the  Xhol  site  in  the  3'  region 
of  the  provirus.  This  deletion  removed  the  coding  capacity 
of  the  last  5  amino  acids  of  the  t r an s me  mb r a ne  portion  of  the 
envelope,  together  with  the  first  60  amino  acids  of  3  '  orf  , 
and  shifted  the  remaining  codons  of  this  gene  out  of  frame. 
This  clone  generated  replicating  virus  when  transfected 
into  normal  T-lymphocyt es  ,  and  T-ly mphob 1  a s t o i d  cell  lines. 
This  virus  is  no  longer  cytopathic  for  normal  T-cells,  and 
does  not  kill  ATH5  cells  which  rapidly  and  reproducibly 
succumb  to  HTLV-III  mediated  killing  by  conventional  virus 
isolates.  Interestingly,  clones  which  contain  less  extensive 
deletions  (  55-109  base  pairs),  restricted  to  3  *  or f ,  generate 


if 
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virus  which  is  extremely  cytopathic.  These  data  argue  that 
cell  killing  may  be  mediated  by  the  carboxy-port ion  of  HTLV-III 
envelope  or  by  the  first  22  amino  acids  of  3 '  orf »  Deletions 
of  the  3 ' or  f  gene  may  therefore  be  expected  to  lead  to  the 
production  on  non -cy t opa th  1  c  virus  particles. 

An  alternative  approach  to  producing  viral  particles 
which  are  non  replicating  is  to  generate  "packaging"  defective 
mutants  which  do  not  encapsidate  the  RNA  and  therefore  produce 
only  "empty"  particles.  Although  much  is  known  about  retro¬ 
viral  replication  and  the  processing  of  gag ,  pol ,  and  env 
products  in  Infected  cells,  the  precise  mechanics  of  virus 
particle  formation  and  the  mechanism  by  which  the  viral 
genome  is  prefentlally  packaged  into  virions  remains  unclear. 
Presumably,  out  of  the  array  of  viral  and  cellular  mRNA's, 
ribosomal,  and  transfer  RNA's  found  in  infected  cells,  this 
mechanism  permits  the  virus  assembly  apparatus  to  select  and 
incorporate  its  own  genomic  viral  RNA. 

Studies  in  the  avian  and  murine  retrovirus  system  have 
suggested  that  virus  particle  formation  can  occur  in  the 
absence  of  genomic  RNA  (7)  and  sequences  that  Intervene 
at  the  5'  LTR  and  gag  are  crucial  for  virus  specific  packaging 
(8,  9,  10).  Deletions  in  the  sequences  interviewing  at  5' 

LTR  and  ga g  regions  of  HIV-1,  therefore,  may  be  expected  to 
give  rise  to  "empty"  (non-viral  RNA  containing  particles,  not 
containing  viral  RNA). 

3 .  Rationale 

The  specific  goals  of  this  portion  of  the  contract  are 
to  produce  defective  HIV-1  particles  which  contain  the  viral 
structural  proteins,  but  are  non-inf ect  ious  and  non- 
cytopathlc.  To  accomplish  this,  it  is  first  necessary  to 
clearly  define  the  biological  activity  of  the  non-s t r uc t u ra 1 
proteins  and  the  role  they  play  in  Infectivity  and  cyto- 
pathogenicity.  One  approach  to  the  study  of  viral  function 
is  to  introduce  specific  mutations  in  a  biologically  active 
virus  genome,  and  examine  the  consequences  of  such  alteration 
upon  subsequent  transfection  into  recipient  cells.  Specific 
deletions  or  alterations  of  the  viral  genome  can  be  accom¬ 
plished  by  site  directed  mutagenisis  or  by  the  combined 
actions  of  exo-  and  endo-nucle ases . 

Three  portions  of  the  viral  genome  were  selected  for 
detailed  study:  the  s o r  and  the  3 ’ orf  reading  frames  and  the 
5'  end  of  the  genome  involved  in  packaging  recognition. 

As  discussed  in  the  Background  section,  mutations  in  s or  may 
be  expected  to  generate  particles  which  can  replicate  and 
induce  cpe,  but  which  have  reduced  Infectivity.  Deletions 
in  the  5'  end  of  gp41  are  of  special  Interest  in  this  regard. 


Mutations  in  3  '  or f  may  be  expected  to  be  defective  in  cell 
killing.  Finally  packaging  defective  mutants  may  produce 
"empty"  particles  Incapable  of  replication  or  inducing  cyto- 
pathlc  effect.  By  Introducing  a  series  of  mutations  at 
specific  sites  within  these  regions,  the  biological  activity 
of  resulting  clones  can  be  subsequently  determined  in  trans¬ 
fected  cells.  Assays  for  replication,  packaging,  and  cytopathlc 
effect  in  transfected  cells  need  be  designed  to  properly 
characterize  the  nature  of  each  mutation  defect.  Using  this 
type  of  strategy,  it  should  be  possible  to  precisely  localize 
the  genetic  sequence  and  function  responsible  for  infectivity, 
cy t opathogenl ci ty  and  packaging. 

4.  Experimental  Methods 


Sor  Deletion  Mutants  -  A  series  of  s o r  mutants  were  gen¬ 
erated  from  pHXB2gpt  (11)  by  removal  of  an  EcoRI  site  in  the 
polylinker  of  the  vector.  Mutant  S  was  prepared  by  removal 
of  the  sequences  between  the  Ndel  and  Ncol  restriction  sites 
(nucleotides  4707  to  5259)  and  religation,  additional  mutants 
were  prepared  by  site  specific  mutations  using  oligonucleotide- 
directed  mutagenesis.  These  mutations  introduced  translational 
stop  codons  at  various  points  In  the  sor  frame  downstream  of 
the  s  o  r  /  po 1  overlap  (Fig.  1).  To  construct  these  mutants 
the  EcoRI  to  EcoRI  fragment  (nucleotides  4230  to  5322)  of  the 
HTLV-III  genome  of  BHIO  was  subcloned  into  an  M13  phage 
vector,  and  mutagenesis  performed  as  described  previously  (12). 
Sequences  in  this  region  of  BHIO  differ  from  those  of  HXB2 
only  at  nucleotide  position  4506  which  results  in  a  proline 
to  serine  substitution  in  pol .  Mutation  6.9  was  introduced 
using  the  25-raer  ( GGATGAGGGCTTTCTTAGTG ATGCT ) ,  converting 
the  tyrosine  codon  (TAT)  at  residue  55,  into  a  stop  codon 
(TAA).  In  a  similar  approach  was  used  to  replace  the  serine 
codon  (TCA)  at  position  42  was  replaced  with  a  stop  codon 
(TAA)  in  clone  3.3,  and  to  change  the  glutamine  codon  (GAA) 
at  residue  100  to  a  stop  codon  (TAA)  in  clone  153.  Following 
confirmation  by  DNA  sequencing  the  mutated  fragments  were 
subcloned  into  the  original  proviral  clone. 

3  *  orf  Mutat ions  -  A  series  of  deletion  mutations  in  the  3 ’ o  r  f 
region  were  constructed  from  pHXB2-D  in  collaboration  with 
Lee  Ratner.  Purified  phage  DNA  was  cut  with  Xhol  and  sub¬ 
jected  to  Bal31  digestion.  Aliquots  were  removed  at  various 
times  of  digestion  and  religated  to  generate  a  series  of 
deletions  mutations  about  the  Xhol  site  (Fig.  2).  Deletions 
ranging  in  size  from  approximately  150-500  nucleotides  and 
extended  into  the  gp41  region  were  selected  for  further 
study. 

Packaging  Defective  Mutants  -  Packaging  defective  mutants 
were  made  by  producing  deletions  in  the  region  between  the 
5 ' L T R  and  gag  coding  sequences  of  the  plasmid  clone  pHXB2gpt. 
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This  plasmid  consists  of  a  biologically  active  UTLV-III 
genome  combined  with  the  E.  coll  xanthine  guanine  phosphono- 
ribosyl  phosphatase  gene  (gpt)  and  contains  a  unique  BssHII  site 
in  the  sequences  that  intervene  the  5'  LTR  and  the  beginning 
of  the  gag  gene.  A  series  of  deletion  mutants  about  the 
BssHII  site  were  generated  using  the  Bal  31  digestion- 
regulation  approach  (Fig.  3).  Mutants  pHXB2^  and  pHXB2^1 
are  deletions  of  51  and  57  nucleotides  in  positions  224-227  and 
224-283,  respectively.  Additional  mutants  were  obtained  by 
site  directed  mutagenesis.  The  mutant  XlO-l^^  g  deleted  of 

9  nucleotides  between  positions  310-320.  The  mutant  ’''293 
contains  deletions  between  293  and  328.  In  all  cases,  the 
splice  donor  site  (position  287)  is  preserved. 

Characterization  of  Deletion  Mutants  -  Purified  DNA  from 
the  various  deletion  mutants  produced  as  described  above  was 
transfected  into  permissive  human  lymphoid  cells  (H9,  Molt  4, 
or  PH A-s 1 1 mu  1  a t e d  blood  mononuclear  leukocytes)  or  to  the 
monkey  kidney  f i broblas toid  line  Cos-1  using  the  calcium 
phosphate  precipitation  technique  (13).  In  these  experiments 

10  ug  of  cesium  chloride  gradient  purified  plasmid  DMA  was 
introduced  into  1  x  10^  cells.  Since  the  plasmid  clones 
contain  the  gpt  gene  (which  allow  recipient  cells  to  grow  in 
the  presence  of  ml cr ophi no  1 i c  acid),  stably  transfected  cell 
lines  containing  the  transfected  genomes  could  be  selected  by 
growth  in  appropriate  media  (HAT).  The  transfected  cultures 
were  monitored  at  approximately  weekly  intervals  for  the 
appearance  of  HIV-1  gag  and  env  by  indirect  immunofluorescence 
with  antibodies  to  pl7,  p24,  and  gp41.  Expression  of  the 
functional  po  1  gene  product  was  determined  by  reverse 
transcriptase  assay.  The  presence  of  viral  particles  was 
examined  by  electron  microscopy. 

5.  Results 


Deletions  of  Sor 


DNA  from  each  of  the  sor  deletion  mutants  was  transfected 
i  ri  t  o  permissive  lymphoid  cells  (H9,  Molt4  and  PHA-st  imulat  ed 
blood  mononuclear  leukocytes),  and  the  transfected  cultures 
were  monitored  at  approximately  weekly  intervals  for  HIV-1 
expression.  Molt  4  cells,  normal  T-cells  or  H9  cells  trans¬ 
fected  with  the  deletion  sor  mutants  consistently  failed  to 
express  virus  as  detected  by  immunofluorescence,  reverse 
transcriptase  assays  or  electron  microscopy.  Southern  blotting 
analyses  on  these  samples  showed  transient  uptake  of  plasmid 
DNA  but  no  detectable  provlral  sequences  in  long-term  cultures. 
In  contrast,  cells  transfected  with  pHXB2gpt  reproduclbly 
and  rapidly  yielded  virus  producing  cells. 

The  ability  of  the  sor  mutants  to  produce  virions  was  also 
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examined  by  performing  a  series  of  transfections  using  the 
SV40  transformed  cell  line  Cos-1  as  a  target.  These  experi¬ 
ments  were  aimed  to  amplify  virus  production  by  exploiting 
the  capacity  of  the  Cos-1  cells  to  promote  episomal  replica¬ 
tion  of  plasmids  carrying  the  SV40  origin  of  replication 
(Including  plasmids  derived  from  pHXB2gpt)  in  transient 
assays.  The  transfected  Cos-1  cells  were  then  cocultivated 
with  Molt  3  cells  which  are  permissive  for  HIV-1  infection. 


As  shown  in  Table  1,  virus  particles  morphologically 
similar  to  wild  type  were  recovered  from  Cos-1  cell  cultures 
transfected  with  s  or  mutants  6.9,  3.3,  153,  and  S.  The 

level  of  virus  production  (both  extracellular  and  budding 
virions)  and  t r a  ns ac 1 1 va t ion  potential  was  indistinguishable 
from  that  seen  in  pHXB2-gpt  transfected  cultures.  However, 
supernatants  removed  from  s or  mutant  cultures  (containing 
cell  free  virus)  failed  to  Infect  H9  cells  in  repeated  attempts 
These  results  indicate  that  the  s  o  r  gene  of  HIV  is  not  critical 
for  the  production  of  normal  levels  of  morphologically  intact 
virus  particles,  but  mutant  viruses  are  limited  in  their 
capacity  to  establish  stable  infection. 


Virus  derived  from  transfection  of  Cos-1  cells  (0KT4-, 
African  Green  Monkey  Cells)  with  s o r  mutant  proviral  DNA's, 
was  resistant  to  transmission  to  0KT4+  'susceptible'  cells 
under  cell-free  conditions,  and  was  transmitted  poorly  by 
coculture.  In  contrast,  virus  derived  from  clones  with  an 
intact  so  r  frame  was  readily  propagated  by  either  approach. 
Normal  amounts  of  gag,  en v  and  p o 1  derived  proteins  were 
produced  by  all  four  mutant  genomes  and  assays  performed  in 
both  lymphoid  and  non-lymphoid  cells  indicated  that  their 
t r a  ns -a c 1 1 va t i ng  capacity  was  intact  and  comparable  with  wild 
type.  These  data  show  that  the  s or  gene,  although  not 
absolutely  required  in  HIV  virion  formation,  influences  virus 
transmission  in  vitro  and  is  crucial  in  the  efficient 
generation  of  infectious  virus.  This  data  also  suggest  that 
sor  Influences  virus  replication  at  a  novel,  pos t -t rans la- 
tlonal  stage  and  its  action  is  independent  of  the  regulatory 
genes  tat  and  art/trs. 


Deletions  in  3'orf 


A  total  of  36  HIV-1  mutants  with  deletions  in  the  3' 
region  of  the  genome  have  been  constructed  and  characterized. 
Of  these,  clones  369,  468,  372  and  429  were  generated  by  site 

directed  mutagenesis,  while  the  remainder  were  obtained  by 
Bal  31  digestion  at  the  Xhol  site.  The  results  of  sequence 
analysis  across  the  deleted  regions  of  these  plasmid  con¬ 
structs  have  now  been  completed  and  are  summarized  in  Table  2. 
Briefly,  these  mutants  fall  into  4  categories;  those  with 
deletions  solely  within  the  3'orf  regions  (clones  230,  330), 


clones  with  alterations  in  3  *  o  r f  and  gp41  en v  (clones  329 
through  362,  Table  2)  and  clones  with  an  additional  altera¬ 
tions  in  the  neighboring  3*  regions  (LTR  and  polypurine 
tract  clones  358-1  9A)  or  5'  regions  (tat-III  and  t  r s  clones 
360-327).  The  resulting  alterations  in  the  carboxyl  end  of 
gp41  is  Indicated  by  the  number  of  amino  acids  deleted  and 
the  number  of  additional  amino  acids  added  in  frame. 

Analysis  of  the  biological  properties  of  these  HIV-1 
mutants  has  been  carried  out  at  two  levels.  First,  the 
ability  of  mutant  genomes  to  give  rise  to  intact  virus 
particles  with  reverse  transcriptase  activity,  was  analysed 
by  transfecting  each  plasmid  into  Cos-1  cells  and  examining 
the  culture  supernatant  for  virion  production  by  EM  and  RT 
assays.  The  data  presented  in  Table  2  shows  that  all  constructs 
were  replication  competent  except  for  those  containing  dele¬ 
tions  in  the  tat-III  or  t r s  genes  (clones  306,  327  ).  This  is 
consistent  with  previous  reports  that  the  t  at  and  t  r s  genes 
are  essential  for  virus  replication.  A  second  analysis  was 
carried  out  to  determine  the  ability  of  env  mutants  to  be 
propagated  in  culture.  This  was  done  by  co-culturlng  trans¬ 
fected  Cos-1  cells  with  0KT4  positive  recipient  cells  (either 
Molt  3  or  H9  cells).  Although  the  analysis  is  only  half 
completed  preliminary  results  suggest  that:  (1)  relatively 
small  changes  in  the  carboxyl  region  of  gp41  env ,  diminish 
the  p r opaga t ion /inf ec t 1 vi ty  of  virus;  (2)  these  effects  are 
more  markedly  seen  when  Molt  3  cells  are  used  as  recipients 
rather  than  H9  cells;  and  (3)  deletions  which  enter  the  LTR 
region  results  in  virus  which  cannot  be  efficiently  propagated. 

Packaging  Mutants 

Studies  of  the  biological  properties  of  the  packaging 
mutants  have  not  been  completed.  However,  the  initial  analysis 
has  shown  that  clones  pHXB2^  and  pHXB2^^  generate  morpho¬ 
logically  normal  virus  particles  upon  transfection  into  the 
Cos-1  cell  line.  Furthermore,  these  particles  are  infectious 
(and  therefore,  presumably  contain  genomic  RNA)  and  can 
easily  be  transmitted  to  recipient  0KT4+  cells.  These  data 
argue  that  sequences  between  224-283  are  unlikely  to  be 
Important  in  the  process  of  viral  packaging.  Mutant  X10-l25^ 
and  other  clones  bearing  more  extensive  deletions  downstream 
of  the  splice  donor  are  currently  under  study. 

6.  Discussion 


Mutations  in  Sor 


The  studies  described  in  this  report  have  shown  that 
construction  and  analysis  of  deletion  mutations  is  a  very 
powerful  approach  to  elucidating  gene  function.  We  have 


shown  that  removal  or  truncation  of  s  o  r  results  in  virus 
progeny  that  have  a  much  reduced  (>100-fold)  capacity  to 
infect.  Furthermore  s  o  r  mutant  viruses  were  transmitted 
less  well  under  co-culture  conditions  (in  which  cell  to  cell 
transmission  Is  likely  to  be  important).  Interestingly,  the 
effects  of  truncating  so r  (in  the  case  of  mutants  3.3,  6.9, 

153)  were  similar  to  its  complete  removal  (in  the  case  of 
mutant  S)  suggesting  that  the  ca r boxy - 1 e r mi na 1  portion 

of  s or  (downstream  from  residue  100)  may  Include  a  function¬ 
al  domain.  However,  the  possibility  that  s o r  is  non-f un c t i ona 1 
because  it  is  deprived  of  critical  elements  in  the  carboxy- 
termlnal  of  the  protein  (necessary  for  the  correct  folding  or 
processing)  is  not  excluded.  Since  the  level  of  viral  RNA, 
proteins  and  viral  particles  produced  by  s o r  defective  genomes 
could  not  be  distinguished  from  that  of  wild  type,  we  suspect 
that  s  o  r  exerts  its  effects  at  a  pos t - 1 r ans 1  a t i ona 1  level. 

This  novel  regulatory  mechanism,  mediated  by  sor,  could 
involve  late  events  in  virus  maturation.  These  findings 
substantiate  the  complex  nature  of  the  HIV  genome,  underscoring 
the  sophisticated  transcriptional,  pos t -t r ans c r i p t i ona 1  and 
pos t -t r a  ns  la t i ona 1  controls  which  operate  to  regulate  virus 
expression  in  Infected  cells.  While  the  mechanism  of  how  sor 
enhances  virus  propagation  is  not  yet  understood,  several 
distinct  mechanisms  could  be  postulated  and  are  currently 
being  investigated.  It  is  possible  that  sor  is  a  structural 
component  of  the  virion  particle  which  acts  as  a  'second 
envelope',  required  for  efficient  transmission.  Since  it  is 
difficult  to  detect  sor  in  as  large  amounts  as  gpl20  and  gp41 
(in  either  infected  cells  or  virions)  additional  studies  are 
necessary  to  evaluate  this  likelihood.  Alternatively,  the 
sor  gene  might  be  Involved  in  stabilizing  or  processing 
envelope  so  that  assembly  of  infect ious  virus  is  Increased, 
or  in  potentiating  the  cellular  environment  in  which  viral 
replication  occurs. 

Mutation  in  3'orf 

Studies  with  deletions  in  the  3'orf  gene  have  shown  that 
this  gene  is  not  essential  for  virus  particle  formation  or 
reverse  transcriptase  activity.  However,  even  small  changes 
in  the  carboxyl  region  of  gp41  env  (which  overlaps  the  3'orf) 
greatly  diminish  viral  Infectivity.  Thus  virus  produced  in 
Cos-1  cells  which  amplify  the  gene  expression,  transmit 
poorly  to  H9  cells  and  even  less  well  or  not  at  all  to  Molt  3 
cells.  The  host  cell  itself  plays  a  large  role  in  this  decrease 
since  these  effects  are  more  pronounced  when  Molt  3  cells  are 
used  as  recipients  rather  than  H9  cells. 

The  cytopathic  properties  of  the  3'orf  mutants  is  currently 
being  evaluated  using  the  sensitive  ATH8  cells  or  PHA  stimulated 
normal  T-cells  as  targets.  Immunop r ec 1 p 1 t a t ion  studies  are 


also  being  performed  to  visualize  the  truncation-elongation 
of  env  derived  gp41,  gpl20,  and  gpl60  (precursor)  proteins, 
created  by  the  mutations.  Finally,  the  interactions  of  gp41 
and  HLA  class  I  molecules  will  be  Investigated  to  try  and 
see  whether  the  diminished  propagation  of  gp41  mutants  in 
Molt  3  cells  might  result  from  their  failure  to  Interact 
with  class  I  determinants  as  has  been  suggested  from  studies 
by  Fulvia  Veronese  (Litton  Bionetics)  and  collaborators 
( di Mar z o -Ve rone s e  ,  unpublished). 

Packaging  Mutants 

The  preliminary  characterization  of  the  5'  deletion 
mutants  of  sequences  between  223  and  284  have  failed  to 
detect  alterations  in  virus  morphology  or  infectlvity. 
Additional  characterization  of  these  mutants  and  of  those 
containing  more  extensive  deletions  are  currently  in  progress. 
The  specific  infectlvity  (infectious  centers/ug)  will  be 
determined  on  Cos-1  cells.  Transfections  of  various  lymphoid 
cells  will  be  attempted  to  generate  lines  of  infected  cells. 


These  stable  cell  lines,  will  be  characterized  in  terms 
of  viral  protein  expression  (Western  blotting,  Immuno- 
precipitation,  and  immunofluorescence),  expression  of  viral 
RNA  species  (Northern  blotting)  and  the  capacity  to  generate 
virion  particles.  To  determine  whether  these  particles 
contain  virus  specific  RNA  (i.e.  are  correctly  'packaged')> 
Northern  blotting  of  sucrose  banded  'purified*  virus  particles 
will  be  attempted  using  a  BHIO  insert  labeled  with  as  a 

probe,  and  purified  "wild  type"  virus  as  a  positive  control. 
The  amount  of  viral  RNA  present  per  virion  particle  will  be 
quantitated  to  verify  whether  'empty'  particles  are  in  fact 
produced . 


MOLECULAR  STUDIES  OF  TAT-III  ACTIVITY 


I 


1.  Statement  of  Problem 


The  level  of  HIV  gene  expression  In  infected  cells  is 
subject  to  a  complex  set  of  virus  encoded  regulatory  ele¬ 
ments.  Infection  by  HIV-1  i n  vivo  may  be  associated  with 
an  asymptomatic  Interval  frequently  lasting  from  months  to 
years  before  the  development  of  actual  disease  (14).  Such 
a  latent  state  of  Infection  can  also  be  duplicated  in  vitro, 
where  HIV-1  infected  CD4+  cell  lines  can  be  maintained  for 
long  periods  of  time  in  the  absence  of  detectable  virus 
production  (15).  Events  that  trigger  the  transition  from 
low  level  or  latent  Infection  to  productive  viral  replica¬ 
tion  remain  poorly  defined.  Immune  activation  of  infected 
T-cells  can  greatly  stimulate  HIV  replication,  apparently 
through  effects  on  the  regulatory  elements  within  the  long 
terminal  repeat.  An  understanding  of  the  mechanism  of  HIV-1 
gene  activation  is  essential  for  designing  therapeutic 
approaches  to  control  the  onset  of  the  disease  in  HIV-1 
antibody  positive  individuals.  This  Information  will  also 
have  major  implications  in  the  design  of  'safe'  vaccines 
in  which  the  genome  will  not  be  subject  to  rapid  activation. 

2 .  Background 

Expression  of  the  HIV-1  genome  is  tightly  regulated  by 
the  activity  of  several  viral  genes  and  regulatory  sequences. 
The  t  at -1 1 1  gene  located  between  the  s  or  and  e  nv  genes  has 
been  reported  to  enhance  transcription  of  genes  linked  to  the 
long  terminal  repeats  LTR's  in  Infected  cells  (tat  response), 
and  presumably  plays  a  central  role  in  HIV-1  gene  activation 
(16).  The  gene  is  expressed  as  a  1. 9-2.0  kb  mRNA  generated 
by  two  splicing  events  and  has  many  similarities  to  the 
corresponding  tat  gene  in  HTLV-I  and  HTLV-II  (17).  however, 
whereas  the  tat  I  and  II  proteins  are  reported  to  enhance 
transcription,  the  tat  III  gene  product  was  shown  to  enhance 
expression  of  protein  at  a  pos 1 1 r ans c r ip t i on  a  1  level  (18). 
Another  gene  designated  art  or  t  r s ,  possibly  expressed  from 
the  same  mRNA  which  encodes  the  tat-III  gene,  also  regulates 
HTLV-III  virus  expression  pos t t r ans c r 1 p t i o na 1 1 y ,  most  likely 
by  regulating  of  the  accumulation  of  genomic  and  spliced 
viral  mRNA  (19). 

The  level  of  gene  expression  has  been  reported  to  be 
dependent  on  functional  elements  within  the  LTR  sequence  of 
the  virus,  including;  (1)  a  negative  regulatory  element  (NRE), 

(ii)  an  enhancer  upstream  of  the  promoter  or  TATA  box,  and 

(iii)  tat  response  region  downstream  of  the  promoter  at  -17 

to  +80  (20).  The  tat  response  region  allows  greatly  Increased 
expression  of  linked  genes  in  HTLV-III  infected  cells  relative 
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to  uninfected  cells  even  If  placed  In  a  similar  position 
downstream  of  heterologous  promoters  (20).  The  HTLV-III-LTR 
was  also  reported  to  contain  three  Spl  binding  sites  as 
demonstrated  by  point  mutagenesis  and  protection  experiments 
(21).  However,  the  functional  discrimination  of  these  sites 
was  only  partially  dissected.  The  HIV-1  LTR  contains  both 
ci s  and  t  r ans  regulating  elements  which  have  not  been  pre¬ 
cisely  localized.  Much  work,  remains  to  be  carried  out  in 
defining  the  function  of  the  regulatory  elements  and  their 
interactions  with  the  viral  genome. 


3 .  Rationale 


A  series  of  deletions  within  the  HTLV-III  LTR  was  intro¬ 
duced  to  localize  the  c i s  acting  elements  upstream  of  the 
promoter  within  0  3  and  to  localize  the  t  r ans  acting  response 
sequences  in  the  R  region.  To  study  the  resulting  LTR  func¬ 
tion  of  these  alterations,  these  constructs  were  linked  to 
the  chloramphenicol  acetyl  transferase  gene  (CAT)  for  assay 
by  transfection  Into  HTLV-III  Infected  and  uninfected  cells. 
The  location  of  els  acting  regulatory  elements  could  then  be 
deduced  from  the  analysis  of  deletion  clones  in  LTR  which 
fall  to  produce  a  CAT  expression.  Trans  acting  regulatory 
elements  can  be  identified  by  co-t rans f ect 1  on  of  LTR-CAT  with 
other  plasmid  constructs  not  linked  to  CAT  but  containing 
deletions  in  LTR,  t  at  or  a  rt  / 1  r s . 

4 .  Experimental  Methods 

Plasmid  Clones 

Plasmid  pSVOCAT,  pSV2CAT,  and  RSVCAT  were  obtained  from 
Dr.  B.  Howard  (22).  Clone  pC15CAT  was  constructed  by  blunt 
ending  the  Pst  I  cDNA  insert  of  clone  pC15  (17)  with  T4 
DNA  polymerase  and  the  four  NTPs  (Fig.  4).  Hind  III  linkers 
were  added  and  the  resulting  fragment  was  cloned  Into  the 
Hind  III  site  of  pSVO.  DNA  sequence  analysis  confirmed  that 
the  entire  3'  region  of  the  C15cDNA  insert  was  present  in 
C15CAT  including  the  poly  A  tall.  VHHCAT  was  constructed  by 
cloning  the  most  3'  Hind  III  fragment  of  the  HXB2  provlrus 
(24)  into  the  Hind  III  site  of  PSVO. 

Synthetic  Oligonucleotides 

Oligonucleotides  were  kindly  synthesized  by  L.  Lee  of 
Program  Resources  Inc.,  Frederick,  MD. 

Construction  of  3'  deletion  Mutants 


The  clone  -117  (Fig.  4)  was  cleaved  with  either  Bgl 
II  +  Sst  I  to  create  -117  BS  or  Sst  I  for  -117  S.  The  DNAs 
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were  blunt  ended  as  described  above  then  self  ligated  and 
transfected  into  the  bacterial  strain  HBlOl  (BRL).  Clone 
-117  +56  was  created  by  ligating  an  Sst  I  to  Hind  III 
synthetic  oligonucleotide  fragment  containing  the  sequences 
+39  to  +56  from  the  CAP  site  into  the  Sst  I  and  Hind  III 
sites  of  clone  -117. 


CAT  Assays 


CAT  assays  were  performed  as  described  previously  (24) 


DNA  Sequence  analyses 


The  5'  deletion  clones  were  cleaved  with  Xba  I.  The  5' 

Xba  I  ends  were  labeled  in  a  reaction  with  T4  polynucleotide 
kinase  and  ^^P-ATP  (7000  Ci/mmole,  New  England  Nuclear)  and 
then  cleaved  with  Hind  III.  The  Xba  I  to  Hind  III  fragments 
were  purified  from  6%  acrylamide  gels  and  sequenced  by  the 
Maxam- Gi 1  be r t  method.  The  -65E  and  -48E  clones  are  cleaved 
with  Bgl  II,  5'  end  labeled  as  above  and  then  cleaved  with 
Hae  II.  The  Bgl  II  to  Hae  II  fragments  of  each  were  purified 
as  above  and  then  sequenced  by  the  Maxam-Gllbert  method  (25). 

5 .  Results 

LTR-directed  expression  in  Infected  and  uninfected  cells 

The  5'  deletion  clones  (Fig.  5)  were  transfected  into 
H9  cells  or  H9/III  cells  and  CAT  assays  performed  on  the 
cell  lysates  within  40-48  hours  post  transfection.  The 
results  of  the  assays  are  shown  for  each  clone  in  Fig.  5. 

The  first  column  shows  values  for  H9  cells  and  the  second 
column  shows  the  results  in  H9/III  cells.  The  much  increased 
level  of  CAT  activity  in  infected  cells  is  apparent  from 
the  reaction  conditions.  The  RSVCAT  plasmid  (values  in 
parenthesis)  served  as  a  positive  control  in  these  experiments 
for  both  transfection  efficiency  and  CAT  expression.  The 
values  for  infected  cells  show  >1000  fold  activity  as  opposed 
to  uninfected  cells  for  clones  containing  complete  LTRs. 


Deletions  to  position  -117  showed  no  significant  dif¬ 
ferences  (<2  fold  variation)  from  CD12CAT  in  either  infected 
or  uninfected  cells.  However,  the  activity  of  clone  -103 
activity  in  uninfected  cells  was  reduced  by  a  factor  of  4-9 
fold  while  values  for  infected  cells  did  not  significantly 
change.  Further  deletions  to  -65  and  -48  resulted  in  loss 
of  activity  in  uninfected  cells  and  either  partial  or  complete 
loss  of  activity  in  Infected  cells  respectively. 


Negative  regulatory  element  were  not  detected  in  the 
U3  region  In  this  study  In  contrast  to  previous  studies  (19). 
The  reasons  for  this  are  unclear.  The  6  fold  suppression 
of  activation  reported  would  still  allow  greater  than  1000 
fold  activity  in  HTLV-III  infected  cells  relative  to 
uninfected  cells  reflecting  in  our  opinion  a  minor  role 
for  this  element  even  if  it  existed. 


Localization  of  enhancer  elements 

The  loss  of  activity  in  uninfected  cells  upon  deletion 
of  sequences  between  -103  and  -65  suggested  that  an  enhancer 
element  might  be  located  within  this  region.  To  test  this 
hypothesis,  an  oligonucleotide  spanning  -104  to  -80  was 
synthesized  with  Xba  1  compatible  ends  and  cloned  into  the 
Xba  1  site  of  -65  and  -48.  The  resulting  clones  were  assayed 
for  functional  restoration.  The  results  of  the  CAT  analyses 
of  these  clones  are  shown  in  Table  3.  Clone  -65E2  contained 
the  sequences  in  the  5'  to  3'  orientation  while  clone  -65E5 
contained  the  reversed  orientation.  The  direct  repeats  were 
inserted  ahead  of  -48  in  the  5’  to  3'  orientation  in  clones 
-48E9  and  -48E14  and  in  the  3'  to  5’  orientation  in  clone 
-48E8.  DNA  sequencing  confirmed  that  the  clones  contained 
the  entire  -105  to  -80  region  except  -48E14  which  is  missing 
base  pair  -80. 

In  all,  plasmids  -65E2  and  -65E5  were  positive  within 
or  above  the  range  seen  for  the  fully  active  LTR  sequences 
of  CD12  while  -65  (data  not  shown)  and  the  clones  -48E8, 

-48E9  and  -48E14  were  approximately  10  fold  less  active. 

Since  the  activities  seen  with  these  clones  did  not  depend  on 
the  orientation  of  the  inserts  the  direct  repeat  region  has 
the  property  of  the  enhancer. 

The  tat  response  region  (TAR) 

Previous  studies  have  shown  that  heterologous  promoter 
and  enhancer  sequences  could  be  placed  upstream  of  the  -17 
to  +80  region  of  the  HTLV-III-LTR  and  be  activated  1000  fold 
(19).  We  have  made  further  deletions  in  this  region  to 
more  precisely  define  the  elements  which  are  responsible  for 
the  transactivation  phenomenon.  For  this  we  used  the  tat 
responsive  clone,  -117,  to  construct  a  deletion  from  the 
Bgl  II  site  at  +20  to  the  Sst  I  site  at  +38.  The  clone, 

-117  BS,  lacked  tat  responsiveness  but  displayed  normal 
levels  of  expression  in  H9  cells  as  shown  in  Table  3.  Thus 
sequences  intermediate  of  Bgl  II  and  Sst  I  were  indispensable 
for  tat  response.  Furthermore  a  deletion  of  four  base  pairs 
of  the  Sst  1  site  GAGCTC  at  +34  was  also  made.  This  clone, 


-117  S,  had  no  tat  response  when  transfected  into  H9/III 
cells  (Table  3).  Identical  results  were  obtained  when  clones 
-117  BS  or  -117  S  were  transfected  into  nonlymphoid  COS-1 
cells. 

To  further  delimit  the  region  necessary  for  tat  response 
shown  here  to  extend  downstream  of  base  pair  +38,  a  17  base 
pair  oligonucleotide  of  sequences  from  +39  to  +55  with  5' 

Sst  I  and  3'  Hind  III  insertlble  ends  was  synthesized  and 
cloned  into  the  Sst  I  (+38)  and  Hind  III  (+79)  +55.  The 
resulting  clone,  -117  +56H  was  restored  for  tat  response 
(data  not  shown).  Inspection  of  the  sequences  in  the  region 
of  the  Bgl  II  and  Sst  I  sites  for  significant  features 
revealed  an  inverted  repeat  sequence  of  11  and  10  base  pairs 
at  positions  +12  and  +40.  An  8  bp  directly  repeated  sequence 
(CTCTCTGG)  was  also  present  at  +5  and  +37.  Simply  preserving 
the  Inverted  repeat  sequences  as  in  -117  BS  was  not  sufficient 
for  maintaining  tat  responsiveness.  The  larger  stem  loop 
structure  may  be  important  because  of  the  spacing  between  the 
inverted  repeat.  On  the  other  hand,  the  second  CTCTCTGG 
sequence  was  disrupted  in  both  -117  S  and  -117  BS  and  it  is 
possible  that  this  sequence  may  be  important  for  tat  response. 

6 .  Discussion 

The  experiments  described  here  have  identified  an  enhancer 
element  of  26  base  pairs  within  the  HIV-LTR.  This  enhancer 
is  localized  at  position  -105  to  -80  from  the  CAP  site.  Since 
the  activities  seen  with  the  clones  in  this  region  did  not 
depend  on  the  orientation  of  the  inserts,  the  direct  repeat 
region  has  the  property  of  the  enhancer.  A  perfect  homology 
of  the  10  base  pair  direct  repeat  was  found  in  the  SV40  72 
base  pair  enhancer  region  in  the  sense  of  the  late  mRNA  and 
in  the  18  base  pair  repeated  motif  present  in  the  cytomegalo¬ 
virus  major  Immediate  early  gene  promoter  (26).  Since  the 
corresponding  region  of  HTLV-III  is  biologically  active, 
these  elements  may  have  a  similar  biological  role  in  the  regula¬ 
tion  of  cytomegalovirus  and  SV40  expression  as  well.  Our 
analyses  show  that  deletion  of  the  first  G  of  the  enhancer  is 
not  critical  since  clone  -103  is  active.  Mutational  analyses 
of  the  homologous  sequences  in  SV40  showed  that  the  last 
cytosine  residue  is  critical  for  SV40  enhancer  function.  It 
will  be  of  Interest  to  check  similar  mutations  in  HTLV-III 
for  activity. 

The  presence  of  sequences  at  -65  were  found  to  be  crucial 
for  high  level  activity  along  with  the  enhancer.  Sequences 
within  the  region  are  known  to  bind  the  nuclear  factor  Sp-1. 

The  presence  of  viral  encoded  potentiators  in  the  H9/III 
cell  system  (ie  the  tat  III  protein)  may  provide  a  powerful 
means  of  measuring  very  subtle  changes  in  low  level  trans- 


criptional  activity  (see  below).  Alternatively,  tat  III 
activation  may  include  an  interaction  with  Spl  or  its 
target  sequence.  These  possibilities  are  being  investigated 
in  cell  free  transcription  systems  and  DNA  binding  experiments 
We  conclude  that  the  major  regulatory  elements  Involved  in 
the  region  upstream  from  the  HTLV-III  promoter  consist  of 
an  enhancer  region  from  -104  to  -80  which  acts  cooperatively 
with  the  Spl  binding  site  at  -65,  site  II. 


The  region  downstream  of  the  promoter,  the  TAR  or  trans¬ 
acting  response  region  is  necessary  for  elevated  levels  of 
CAT  expression  in  HIV  infected  cells  relative  to  that  in 
uninfected  cells.  The  TAR  region  had  been  previously 
mapped  to  position  -17  to  +80,  but  the  experiments  described 
here  indicate  that  sequences  downstream  of  +49  a-e  not 
essential  for  tat  response. 

It  has  been  demonstrated  recently  that  a  major  effect 
of  tat  activation  occurs  post  transcriptionally  (18).  We 
report  that  any  structural  feature  in  the  HTLV-III  mRNA 
which  would  play  a  role  in  tat  response  would  have  to  be 
located  upstream  of  +55.  It  is  conceivable  that  the  tat 
protein  or  cellular  factors  induced  by  the  virus  could  bind 
to  such  a  structure  and  facilitate  post  transcriptional 
enhancement  of  expression. 
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C.  MOLECULAR  STUDIES  OF  STLV-III, 
RETROVIRUSES 


(SIV)  AND  OTHER  RELATED 


1.  Statement  of  Problem 


Acquired  Immune  Deficiency  Syndrome  (AIDS)  is  caused  by 
infection  with  the  human  retrovirus  called  human  T-Cell 
lymphotropic  virus  type  III  (HTLV-III)  or  human  immuno¬ 
deficiency  virus  (HIV-1).  Epidemiological  studies  indicate 
that  the  virus  originated  in  Africa.  The  discovery  of  a 
retrovirus  distantly  related  to  HIV  which  infects  a  major 
population  of  African  Green  Monkeys  has  led  to  the  hypothesis 
that  the  simian  virus  (SIV)  may  be  the  ancestor  of  HIV  or 
that  the  two  viruses  might  share  a  common  ancestor.  Although 
SIV  is  apparently  non-pathogenic  in  African  Green  Monkeys,  it 
preferentially  Infects  and  kills  0KT4+  lymphocytes  l_n  vitro 
in  the  same  manner  as  HIV-1  and  it  can  induce  an  AIDS-like 
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disorder  in  captive  macaques.  A  fourth  human  retrovirus  has 
been  Isolated  independently  from  West  African  patients  by 
several  laboratory  groups.  This  virus,  designated  HIV-2 
demonstrated  a  close  relationship  to  SIV  and  a  distant  rela¬ 
tionship  to  HIV-1.  This  virus  is  tropic  but  non-cyt opathi c 
for  0KT4+  lymphocytes  Iji  vitro  and  is  reportedly  non-pathogenic 
in  vivo. 


Several  distinct  proviral  clones  of  HIV-1  have  been  shown 
to  be  infectious  and  cytopathic.  The  nucleotide  sequence  of 
several  of  these  HIV-1  isolates  have  been  determined.  Analysis 
of  the  DNA  sequence  showed  the  existence  of  at  least  eight 
genes:  gag ,  pol ,  env ,  s  or ,  3  *  orf  >  art /trs  and  t  at -3 .  It 
is  Imperative  that  the  structure  and  function  of  each  of 
these  genes  and  their  products  be  defined.  In  order  to 
understand  the  evolution  and  function  of  the  HIV  virus,  the 
molecular  structure  and  functional  regions  of  the  new  viral 
Isolates  STLV-III^Qf^  (SIV)  and  HIV-2  must  be  determined.  Since 
SIV  and  HIV-2  viruses  have  been  shown  to  be  serologically 
c r os  8 -r ea ct i ve  with  some  HIV-1  proteins,  they  offer  valuable 
comparative  models  for  HIV-1  pathogenesis  with  direct  implica¬ 
tions  in  the  development  of  therapeutic  and  vaccine  reagents. 


2.  Background 


Since  1981,  when  the  first  cases  of  the  acquired  immuno¬ 
deficiency  syndrome  (AIDS)  were  reported  (27),  two  new  groups 
of  human  retroviruses  have  been  discovered  and  characterized. 
The  first,  HIV-1  (28,  29,  30),  is  prevalent  in  AIDS  cases 

worldwide  and  seem  to  be  the  primary  cause  of  the  severe 
T-cell  depletion  observed  in  patients  with  AIDS.  The  existence 
of  the  second  group  of  human  retroviruses  was  first  shown  by 
a  serological  study  of  a  healthy  West  African  population 
using  as  target  antigens  the  proteins  of  a  retrovirus  dis- 


covered  in  African  Green  Monkeys  (SILVIIIy^pj^^)  by  the  sero¬ 
logical  c r os s - r ea c t i V i t y  of  its  major  core  protein  with  that 
of  HIV-1  (31).  Several  viral  isolates  have  been  obtained 
from  the  peripheral  blood  of  infected  people  and  designated 
HTLV-IV  (32).  Simultaneously,  a  group  of  West  African  patients 
with  frank  AIDS,  were  found  to  be  Infected  with  a  virus 
related  to  STLV-III  (33,34)  but  not  to  HIV-1.  A  third  group 
(35)  reported  the  isolation  from  a  West  African  patient  of  a 
new  retrovirus  which  was  closely  related  to  STLV-IIIy^Q^  but 
distantly  related  to  HIV-1.  This  virus  was  designated  as 
SBL6669  and  is  a  member  of  the  HIV-2  group  of  human  retroviruses 

All  of  these  new  viruses  are  i mmu no  1 og i ca 1 1 y  related;  where¬ 
as,  they  are  only  distantly  related  to  HIV-1.  Like  HIV-1  (36, 
37),  they  appear  to  use  the  CD4  molecule  as  a  receptor  on  the 
cellular  membrane  (31,32,33).  Direct  experimental  evidence, 
though,  has  been  provided  only  for  STLV-III^q^  (38).  We  have 
performed  molecular  analysis  of  several  of  these  human  and  non¬ 
human  primate  retroviruses  and  have  found  them  to  be  closely 
related.  S  LTV- 1 1 1  is  essentially  identical  to  some  of  the 

HTLV-IV  Isolates  (38,39,40);  whereas,  extensive  polymorphism  has 
been  demonstrated  among  other  HTLV-IV  (B.  Hahn  and  G.  Shaw, 
personal  communication)  and  HIV-2  isolates  (34,41).  The 
Inferred  amino  acid  sequence  of  the  proteins  of  STLV- 1 1 1 
(41,42)  exhibits  an  overall  40%  amino  acid  homology  with  HIV-1. 
The  HIV-2  isolate  (43)  is  as  distant  from  HIV-1  (42%)  by 
amino  acid  sequence  homology  as  STLV-III^qI^.  Conversely, 
H1V-2j.q^  shares  a  much  closer  (72%)  amino  acid  homology  with 
STLV-III^q[^  (44)  indicating  that  STLV-I I  is  a  member  of 

the  HIV-2  group. 

3 .  Rationale 

What  is  not  currently  clear  is  the  degree  and  manner  of 
divergence  among  various  HIV-2  isolates.  This  is  a  critical 
issue  since  the  pathogenic  potential  of  these  viruses  is 
currently  being  debated.  In  order  to  Investigate  the  rela¬ 
tionship  among  different  HIV-2  isolates  we  have  molecularly 
cloned  SIV  and  a  member  of  the  HIV-2  group.  The  nucleic  acid 
sequence  of  SIV  and  HIV-2jji}{_2  been  obtained  and  compared 

with  HIV-2j.o{j,  HIV-1,  and  the  ungulate  retroviruses  Visna  and 
EIAV.  A  comparison  of  the  nucleotide  and  amino  acid  sequences 
of  the  new  viral  Isolates  with  each  other,  HIV-1  and  other 
retroviruses  should  help  to  determine  the  evolution  of  HIV 
and  will  aide  in  the  Investigation  of  the  biological  proper¬ 
ties  of  this  group  of  retroviruses. 
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Experimental  Methods 
Cloning  of  Proviral  DNA 

High  molecular  weight  DNA  was  obtained  from  the  HIV-2 
infected  cell  line  HUT  78  and  partially  cleaved  with  BamHl. 
Different  fragments  of  DNA  were  size  selected  on  a  sucrose 
gradient  and  the  DNA  fraction  containing  20  Kb  fragments  was 
purified  by  ethanol  precipitation.  The  DNA  was  ligated  into 
the  arms  of  EMBL-3  vector,  previously  cleaved  with  BamHl  and 
the  ligated  DNA  was  packaged  i n  vitro.  The  recombinant 
phages  were  plated  and  screened  using  as  a  probe  the  SS35  and 
B16  plasmids  containing  the  env  and  gag  sequences  of 
STLV-IIIy^Qju[,  respectively  (41). 

The  proviral  DNA  of  STLV-I I  was  obtained  from  a  genomic 

library  constructed  from  the  DNA  of  the  Infected  cell  line 
K6W  using  the  lambda  phage  vector  EMBL-3.  Nine  clones  con¬ 
taining  various  overlapping  portions  of  the  STLV-IIIy^f^j^ 
genome  were  purified  and  characterized  by  restriction  endo¬ 
nuclease  analysis.  Five  of  them  were  used  to  generate  viral 
DNA  fragments  for  subcloning  in  appropriate  vectors.  Standard 
DNA  cloning  methods  were  used  as  in  Maniatis  et  al.,  1981  (45). 

DNA  Sequencing 

DNA  restriction  fragments  of  the  provirus  were  removed 
from  the  genomic  phage  clones  and  were  subcloned  into  the 
appropriate  enzyme  sites  of  the  plasmid  vector  Bluescrlbe. 

Other  subclones  were  generated  by  inserting  DNA  restriction 
enzyme  fragments  of  HIV-2fjiH-Z  into  M13 

bacteriophages,  rap8  and  rap9.  The  DNA  sequence  was  obtained 
by  the  dideoxy  chain  termination  procedure  (46)  using  synthetic 
primers  and  both  the  Klenow  fragment  of  E.  coll  polymerase 
and  the  T7  DNA  polymerase  on  single  and  double  stranded  DNA. 
Approximately  3  KB  of  the  HlV-2jjj{^_2  proviral  DNA  was  also 
sequenced  by  the  chemical  degradation  method  of  Maxam  and 
Gilbert  (25).  The  nucleotide  sequence  was  organized  with  the 
help  of  the  computer  program  Mlcrogen  (47). 

5 .  Resu Its 

a.  Genetic  Analysis  of  STLV-IIlAnM  (SIV) 

The  endonuclease  restriction  map  of  the  STLV-IIIy^Q^^  pro¬ 
virus  derived  from  overlapping  phage  clones  is  depicted  in 
the  lower  part  of  Figure  6.  The  DNA  sequence  of  the  entire 
provirus  was  obtained  by  the  dideoxy  chain  termination  method 
and  primer  extension  using  both  single  stranded  and  double 
stranded  template  DNA  (46).  The  complete  sequence  of  the 
STLV-IIIaqh^  provlrus,  derived  by  combining  the  DNA  sequence  of 


the  lambda  phage  clones,  Is  presented  in  Figure  2  and  is 
numbered  from  Che  5'  boundary  of  the  R  region  of  the  upstream 
LTR  to  the  3'  end  of  the  R  region  of  the  downstream  LTR.  The 
overall  homology  at  the  nucleotide  sequence  level  between 
STLV-III^q^  and  HIV-1  is  55%. 

Long  Terminal  Repeat  (LTR) 

The  STLV-III^QVj  LTR  is  800  base  pairs  long.  To  define  the 
boundaries  of  the  LTRs,  we  compared  the  junction  sequence  of 
the  'J3-cellular  sequence  of  the  5'  LTR  and  the  U3-3'-  orf 
sequence  at  the  3'  end  of  the  virus.  Similarly,  the  end  of 
the  L’5  was  identified  by  comparing  the  sequence  of  the  3' 
viral-cellular  junction  and  the  5'  LTR -gag  junction.  The 
boundaries  of  Che  R  region  with  respect  to  the  U3  and  U5  were 
determined  by  comparison  with  the  HIV-1  LTRs,  (48)  and  the 
sequence  of  several  S  T  L  V- 1 1 1  cDNA  clones  (Colombini-Hatch 

et  al.,  personal  communication).  The  sizes  of  the  U3,  R  and 
U5  are  498,  176  and  126  base  pairs,  respectively.  The  sequence 

TAT A A,  found  25  nucleotides  upstream  from  the  start  of  trans¬ 
cription  in  many  eukaryotic  genes,  is  located  in  the  U3  at 
position  -25  from  Che  presumed  boundary  of  the  U3  and  R 
regions.  The  consenus  sequence,  AATAAA,  which  signals  the 
addition  of  the  po 1 y ade ny la t e  tall,  is  located  at  position 
152  to  position  157. 

The  primer  binding  site  (PBS),  located  downstream  from 
the  U5  of  the  5'LTR,  is  complementary  to  the  same  Isoaccepting 
form  of  transfer  RNA  (tRNAjyg3)  as  the  PBS  of  HTLV-IIIB  and 
other  HIV-1  strains  (48),  Surprisingly,  in  view  of  the  close 
relatedness  of  these  viruses,  tne  STLV- 1 1 1  LTR  is  much 

longer  than  the  HIV-1  LTR  (800  versus  634  base  pairs).  The 
STLV-III^q^  and  HIV-2^qj  LTRs  are  closer  in  size  Co  the  HTLV-I 
(49)  and  HTLV-II  LTRs,  and  are  much  larger  than  the  LTRs  of 
the  ungulate  lentiviruses.  The  greater  size  of  the  STLV-III^q^ 
LTR  as  well  as  the  HIV-2  LTR  relative  to  that  of  HIV-1  is  due 
to  the  presence  of  several  regions  in  the  STLV-III^q^  of 
sequences  not  found  in  the  HIV-1  LTR. 


The  gag  and  pol  genes 

Retroviral  core  proteins  are  derived  from  the  proteolytic 
cleavage  of  a  polypeptide  precursor  encoded  by  an  open  reading 
frame  at  the  5'  end  of  the  genome.  The  first  large  open 
reading  frame  in  the  STLV-III^q]^  genome  starts  at  nucleotide 
539  (Figure  7)  and  could  encode  506  amino  acids.  The  length 
of  the  gag  precursor  polypeptide  reported  for  different  HIV-1 
strains  varies  between  500  and  512  amino  acids.  Alignment  of 
STLV-IIIy^Q^  and  HTLV-IIIB  amino  acid  sequences  suggests  that 
its  pl7  homology  contains  135  amino  acids  and  is  likely 
generated  by  cleavage  between  Tyr  and  Pro  at  position  941-946. 
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Prollne  is  the  first  amino  acid  at  the  N-terminus  of  HIV-1 
p24,  the  p26  of  the  ungulate  lentiviruses  EIAV  and  Visna,  the 
HTLV-I  p24,  and  the  HIV-2j.qJ  p26,  and  is  therefore  very 
likely  to  occur  at  the  same  position  in  the  STLV-I I  p24. 

The  overall  homology  of  the  gag  precursors  of  STLV-III^q(^  to 
those  of  HTLV-IIIB  and  HIV-2j.qjJ  is  51%  and  82%  respectively 
(see  Table  4).  The  highest  amino  acid  identity  can  be  found 
between  the  major  core  proteins  of  STLV-I I  and  HTLV-IIIB 

(66%)  and  HIV-2j.q(J  (88%).  These  data  are  consistent  with 
the  strong  cross  reactivity  of  these  proteins  in  radlolmmune 
assays  and  Western  Blots.  A  comparison  of  the  amino  acid 
sequences  of  the  major  core  protein  of  the  lentiviruses 
equine  Infectious  anemia  virus  (EIAV)  and  visna  with  the 
STLV-II  and  HIV-1  gag  proteins  shows  a  significantly  higher 

homology  of  EIAV  to  STLV-IHagm  (29%)  and  HIV-1  (29%),  than 
of  visna  to  STLV-IIl^Gf^  (20%)  or  HIV-1  (19%). 


At  the  carboxy  terminus  of  the  HTLV-IIIB  gag  precursor, 
two  repeated  sequences  have  been  described.  The  first,  an 
Imperfect  repeat  encoding  12  amino  acids,  is  also  present  in 
the  ST  LV- 1 1 1  genome  (Figure  7),  as  well  as  in  the  HIV-2j.q^ 

genome  at  positions  1605-1641  and  1667-1703,  suggesting  that 
this  sequence  duplication  must  have  occurred  long  ago  in  a 
common  ancestor  of  these  three  viruses.  The  second,  present 
in  HTLV-IIIB,  is  a  perfect  repeat  that  also  encodes  12  amino 
acids  (4),  and  is  absent  in  the  STLV-III^Qjk^  genome.  The  fact 
that  this  repeat  In  HTLV-IIIB  is  perfect  and  that  some  other 
HIV-1  isolates  as  well  as  STLV-I I  ly^Gj^  lack  it,  strongly  suggests 
that  the  duplication  in  the  HTLV-IIIB  genome  must  have  occurred 
relatively  recently.  Interestingly,  an  imperfect  direct 
repeat  of  18  amino  acids  can  be  detected  in  the  HIV-2j.qj 
genome,  indicating  that  the  border  between  the  gag  and  the 
po 1  open  reading  frame  may  be  particularly  prone  to  duplica¬ 
tions.  We  have  observed  similar  duplications  in  the  genome 
of  other  HIV-1  Isolates  (unpublished  results). 


The  second  large  open  reading  frame  in  the  STLV-III^Qf^ 
genome  spans  from  nucleotides  1714  to  4875,  has  a  coding 
potential  of  1053  amino  acids,  and  overlaps  the  gag  gene  by 
342  bases  (Figure  7).  We  assume  that  either  a  splicing  or  a 
frameshift  event  in  this  area  of  overlaps  results  in  the 
translation  of  the  po 1  gene.  By  analogy  with  HIV-1  and  other 
retroviruses,  this  open  reading  frame  should  contain  the 
genetic  information  for  protease,  reverse  transcriptase  and 
endonuclease  proteins.  In  HTLV-IIIB,  the  pol  region  encodes 
1003  amino  acids  and  also  extensively  overlaps  the  gag  reading 
frame.  The  amino  terminus  of  the  HTLV-IIIB  reverse  trans¬ 
criptase,  beginning  with  a  Pro-lie,  has  been  identified  by 
amino  acid  sequencing  of  the  purified  p66/p51  viral  enzyme 
(50).  A  perfect  alignment  can  be  found  between  the  amino 
termini  of  the  HTLV-IIIB  and  HIV-2,.q(J  reverse  transcriptase  and 
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the  inferred  amino  acid  sequence  of  the  STLV-III^qj^  RT  gene. 

Thus,  it  is  likely  that  the  Pro-Ile  in  position  2326-2331 
represents  the  first  two  amino  acids  of  the  STLV-1 1 1 
reverse  transcriptase.  The  overall  amino  acid  homology  of 
the  reverse  transcriptase  of  STL  V-1 1 1  versus  HTLV-IIIB  and 
HIV-2j.qj  is  53%  and  76%  respectively  (see  Table  4).  The  amino 
acid  sequence  upstream  from  the  beginning  of  the  reverse  trans¬ 
criptase  is  also  well  conserved  between  HTLV-IIIB  and  STLV- 1 II 
and  is  probably  the  region  which  encodes  for  the  protease  protein 
A  comparison  of  the  amino  acid  sequences  of  that  region  among 
HTLV-IIIB,  EIAV  and  visna  virus  leads  to  the  conclusion  that 
in  these  viruses  the  protease  is  encoded  within  the  same  open 
reading  frame  that  codes  for  the  reverse  transcriptase  gene. 

The  high  degree  of  relatedness  of  the  STLV-III^Cf/j  amino  acid 
sequence  in  this  region  with  that  of  HIV-2j.q^  (88%),  HTLV-IIIB 
(49%),  EIAV  (37%),  and  visna  virus  (30%)  allows  a  more  precise 
localization  of  the  presumed  cleavage  site  (Leu-Pro)  between 
protease  and  reverse  transcriptase  genes. 

Open  Reading  Frames  in  the  Middle  of  the  Genome 


Three  open  reading  frames  have  been  identified  in  the 
middle  portion  of  the  STLV-1 1  genome.  The  first  s  or ,  (al 

referred  to  as  Q),  overlaps  by  97  nucleotides  with  the  end  of 
the  po 1  gene  and  could  encode  for  a  225  amino  acid  protein. 

A  potential  methionine  initiation  codon  is  present  12  amino 
acid  downstream  from  the  beginning.  This  protein  shares  24% 
amino  acid  homology  with  the  s  or  gene  of  HTLV-IIIB  and  other 
HIV-1  strains.  Since  so r  appears  to  be  very  well  conserved 
among  the  various  HIV-1  isolates  and  since  STLV-III^q*^  and 
all  HIV-1  Isolates  so  far  analyzed  are,  in  general,  closely 
related,  the  percentage  of  homology  of  the  STLV-II  s  or 

protein  with  that  of  HTLV-IIIB  is  surprisingly  low.  However, 
the  amino  acid  hydropathy  profile  of  the  simian  and  human 
viral  proteins  is  very  similar  (data  not  shown).  A  higher 
homology  of  64%  can  be  found  between  the  s o r  (Q)  of  HIV-2j.qjJ 
and  STLV-III. 

The  second  short  open  reading  frame  overlaps  with  s o r  by 
171  bases  and  with  the  amino  terminus  of  the  e n v  gene  by  21 
bases  and  spans  from  position  5277  and  5723.  It  could  encode 
149  amino  acids  and  has  a  possible  intlation  codon  at 
position  2  (Figure  8).  The  first  60  amino  acids  share  high 
homology  with  the  X  open  reading  frame  described  in  the 
H1V-2j.qjJ  genome  (43).  The  last  part  of  the  sequence  shares 
considerable  homology  (40%)  with  the  second  exon  of  the 
transactivator  gene  ( t a t - 3 )  of  HTLV-III  which  includes  the 
functionally  Important  domains  of  the  t a t - 3  protein.  The 
amino  acids  conserved  between  the  STLV- 1 1 1  t  a  t  -  3  and  the 

HTLV-IIIB  t a t - 3  are  indicated  by  asterisks  in  the  lower  part 
of  Figure  8.  A  comparison  of  this  portion  of  the  STLV-1 1 1 


genome  with  the  published  HIV-2j.q(J  sequence  indicated  that 
the  STLV-IIl^Qf^  genome  obtained  from  the  K6W  cell  line  is 
lacking  350  nucleotides  in  the  middle  region.  The  deletion, 
as  it  is  schematically  represented  in  Figure  8  includes  part 
of  the  X  region,  most  of  the  R  gene  and  the  first  14  amino 
acids  at  the  amino  terminus  of  the  t a t -3  protein.  Hybridization 
of  an  oligomeric  probe  for  this  deleted  region  to  the  cellular 
DNA  from  which  these  clones  were  derived  indicated  that  these 
sequences  were  also  absent  in  the  majority  of  the  proviruses 
in  the  infected  cells.  This  suggests  that  STLV-III  may  have 
a  propensity  for  deletion  of  regions  from  the  central  area  of 
the  genome.  Since  we  have  not  shown  these  clones  to  be 
biologically  active,  we  cannot  say  whether  or  not  such  deletions 
would  result  in  the  loss  of  biological  activity. 

The  third  open  reading  frame  (5774-5987)  encodes  for  71 
amino  acids  and  has  a  potential  initiating  methionine  in  posi¬ 
tion  3  (Figure  8).  This  open  reading  frame  overlaps  completely 
with  the  coding  region  for  the  amino  terminus  of  the  large 
envelope  protein  gpl20. 


Conserved  domains  in  the  envelope  of  STLV -I I I «  HIV-1  and 
HIV-2 

The  largest  open  reading  frame  in  the  3'  region  of  the 
STLV-III^qj^  genome  corresponds  to  the  gene  encoding  the 
envelope  protein.  The  first  residue  is  at  position  5702  and 
the  probable  initiating  methionine  codon  is  located  10  amino 
acids  downstream.  The  env  gene  probably  extends  to  a  termi¬ 
nation  codon  at  base  8372-8374,  and  could  encode  a  peptide  of 
881  amino  acids.  There  is  also  a  termination  codon  at  base 
7934.  Based  on  protein  alignments  with  the  HTLV-IIIB  env 
gene,  this  termination  codon  is  located  in  the  t ransmerabrane 
portion  of  the  envelope  gene  immediately  prior  to  the  splice 
acceptor  site  in  the  third  exon  of  t  at  -  3  mRNA.  Interestingly, 
the  homology  between  STLV-III^qj^  and  HTLV-IIIB  decreases  con¬ 
siderably  after  the  termination  codon  at  position  7934  of  the 
STLV-III^Gt^  genome,  implying  that  these  sequences  may  not  be 
important  for  viral  replication.  We  have  observed  the  termi¬ 
nation  codon  at  7934  in  several  Independent  STLV-III^qj^ 
clones  and  the  same  termination  codon  is  present  in  a  bio¬ 
logically  active  HTLV-IV  clone  (38),  It  is  possible  that  its 
presence  has  biological  significance  in  the  pathogenesis  of 
STLV-III  infection  l_n  vivo.  Furthermore,  although  the  presence 
of  a  termination  codon  is  not  reported  in  the  published 
sequence  of  HIV-2j.q^j,  it  appears  to  be  present  also  in  some 
HIV-2j.qjJ  clones  in  the  same  position  (Alizon,  M.,  personal  com¬ 
munication).  Comparative  analysis  of  the  envelope  proteins 
of  STLV-III  and  HlV-2rod  indicated  that  70%  of  the 
amino  acids  are  identical  in  both  the  extracellular  and  the 
transmembrane  portion  of  the  protein.  Higher  amino  acid 
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homology  can  be  found  between  the  cleavage  site  and  the 
termination  codon  (82%)  than  at  the  carboxy  terminus  of  the 
envelope  open  reading  frame  (54%).  It  has  been  previously 
shown  that  deletion  of  part  of  the  carboxy  terminus  leads  to 
a  reduction  of  the  cytopathic  effect  In  vitro.  The  presence 
of  this  termination  codon  in  STLV- 1 1  ,  HTLV-IV  and  HIV-2, 

which  would  eliminate  40%  of  the  t ransroembrane  protein,  could 
modulate  the  pathogenicity  of  the  virus  through  variability 
in  the  degree  of  suppression  of  termination  In  vivo. 

An  alignment  of  the  inferred  amino  acid  sequences  of  the 
STLV-III^Qjkj  and  HTLV-IIIB  en v  gene  shows  38%  homology  overall 
(Figure  9),  and  permits  us  to  predict  that  the  signal  peptide 
extends  to  the  serine  at  base  5810-5812  and  that  the  large 
and  small  envelope  proteins  are  cleaved,  as  in  HTLV-IIIB, 
between  the  arginine  and  glycine  residues  at  bases  7307-7312. 
Based  on  this  assumption,  the  homologies  of  the  two  large  e n v 
proteins  and  transmembrane  are  35%  and  44%  respectively. 

The  hydropathy  profile  for  the  putative  STLV-IIIy^Cj^  trans¬ 
membrane  protein  is  almost  identical  to  that  of  HTLV-IIIB, 
consistent  with  functional  similarity.  Much  higher  homology 
can  be  found  between  the  envelope  proteins  of  STLV-III^qj^  and 
HIV-2j.od  (Table  5). 

There  are  several  remarkable  aspects  of  the  homology  of 
the  STLV-III/^Clkj  and  HIV-1  env  proteins  which  may  reflect  on 
the  relationship  of  their  structure  and  common  function  (s). 
First,  there  is  an  extremely  strong  conservation  of  cysteine 
residues.  Of  the  twenty-two  cysteines  present  in  the 
HTLV-IIIB  env  protein  (Figure  9)  21  are  present  in  the  same 
position  in  the  STL  V- 1 1  env  protein.  Different  strains  of 

HIV-1,  which  vary  by  up  to  20-25%  among  themselves  in  the 
amino  acid  sequences  of  the  env  protein,  similarly  show  an 
Invariant  conservation  of  these  cysteines.  This  suggests 
that  disulfide  bridges  play  a  critical  role  in  maintaining 
the  three-dimensional  structure  of  these  env  proteins. 
Similarly,  26  out  of  27  cysteines  are  conserved  between  the 
envelope  proteins  of  STLV-lII^Qji^  and  HIV-2goD  (figure  9). 

The  most  interesting  aspect  of  the  localized  homologies 
between  the  env  genes  of  STLV-III,  HIV-1,  and  HIV-2j.qjJ  is 
that  they  occur  in  those  regions  which  are  most  conserved 
among  different  strains  of  HIV-1.  Conversely,  the  e nv  genes 
of  HTLV-IIIB  and  STLV-III^qj^  are  roost  divergent  in  the  regions 
of  the  HlV-1  env  which  are  most  hy pe r va r i ab le  .  This  suggests 
that  the  conserved  regions  are  Important  for  functions  which 
are  common  to  the  functions  of  the  env  proteins  of  these 
retroviruses.  Since  STLV-III^GM  HIV-1  and  HIV-2^od» 

seem  to  use  the  CD4  protein  as  their  receptor,  this  protein 
may  bind  to  highly  conserved  regions.  In  fact,  it  has  recent¬ 
ly  been  shown  by  Lasky  et  al.  (personal  communication)  that 
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the  region  containing  the  last  cysteine  in  HIV-1  gpl20  is 
critical  for  binding  to  the  CD4  molecule.  This  region  spans 
from  amino  acid  442  to  463  in  the  STLV-III^qj^  envelope  and 
has  an  amino  acid  sequence  almost  identical  in  HTLV-IIIB, 

STL  V- 1 1  ,  and  HIV-2j.qj  (Boxed  region  in  Figure  9). 

Another  Important  antigenic  site  has  been  mapped  in  this 
highly  conserved  region.  Cease  et  al.  (51)  have  identified 
two  peptides,  Ti  and  T2,  which  elicit  T  cell  immunity.  The 
16  amino  acids  Tl  peptide  maps  within  the  putative  envelope 
region  binding  site  of  the  CD4  molecule. 

The  open  reading  frame  homologous  to  the  Trs  gene 

In  the  region  corresponding  to  the  t rans membrane  portion 
of  the  putative  STLV-III  en v  gene  an  open  reading  frame 
encoding  96  amino  acids  can  be  found  between  nucleotides  7900 
and  8187.  This  region  follows  immediately  after  a  splice 
junction  present  in  a  functional  STLV-III^qj^  tat  -3  cDNA 
( Colomblni-Hat ch  et  al.,  personal  communication).  The  probable 
protein  encoded  by  this  region  shares  29%  homology  with  an 
equivalent  open  reading  frame  in  the  HTLV-IIIB  genome.  In 
HIV-1,  the  existence  of  this  gene  was  discovered  by  l_n  vitro 
mutagenesis  of  a  biologically  active  HIV-1  clone  and  designed 
art  or  t  rs  (  52  ,  5  3  ).  The  protein  product  of  the  HIV-1  trs  of 
19  Kd ,  appears  to  be  essential  for  the  synthesis  of  gpl20 
(Knight  et  al.,  1987). 

The  3 ' or f  Gene 


The  last  large  open  reading  frame  is  located  at  the  3'  region 
of  the  viral  genome  and  overlaps  with  the  carboxy  terminus  of 
the  envelope  gene  and  the  U3  region  of  the  viral  LTR.  This 
region  (8196-8828)  has  a  coding  potential  for  211  amino  acids 
and  a  potential  initiating  AUG  codon  at  position  8208.  A 
similar  region  has  been  described  in  the  HTLV-IIIB  genome 
which  has  been  shown  to  encode  a  27K  protein  (54,55).  The 
overall  amino  acid  homology  between  this  putative  STLV-III^qj^ 
protein  and  the  HTLV-IIIB  3 '  or  f  is  34%.  A  more  detailed 
analysis  of  the  amino  acid  composition  of  the  3'  orf  proteins 
among  the  STLV-IIIy^QH^  and  the  American,  French  and  African 
HIV-1  isolates  and  HIV-2j.od  reveals  the  existence  of  a  highly 
conserved  region  of  70%  homology  spanning  about  80  amino 
acids  (Figure  10).  The  hydropathy  profile  of  this  region  in 
the  different  viruses  is  almost  indistinguishable.  It  is 
likely  that  these  80  amino  acids  represent  the  core  of  the 
functional  domain  of  the  3'  orf  protein. 

b.  Genetic  Analysis  of  HIV-2nth-z 


The  HIV-2jjjf^_2  virus  Isolate  was  obtained  by  D.  Zaguri 
from  a  patient  with  immunodeficiency  who  originally  lived  in 


Guinea  Bissau.  A  genomic  library  was  constructed  from  the 
DNA  of  the  Infected  human  neoplastic  T-cell  line 

HUT  78  (  56  ).  Using  the  STLV-I I  provlral  DNA  as  a  probe 

(39),  we  purified  a  molecular  clone  of  the  complete  provirus 
and  determined  its  nucleotide  sequence.  The  genome  of  HIV-2fjj|^_2 
is  9A31  base  pairs  long  and  the  overall  organization  of  the 
open  reading  frames  is  consistent  with  the  order  5'  LTR-gag- 
pol-middle  r e gi on-e nv-3 ' o r f -3 ' LTR.  Similar  structures  have 
been  reported  for  HIV-1  (4,57,58,3).  A  single  structural 
difference  lies  in  the  presence  in  HIV-2ig^f^_2  extra 

open  reading  frame  in  the  middle  of  the  genome  which  is  absent 
in  the  HIV-1  genome  but  which  is  also  present  In  HIV-2j.q(3  and 
STLV-IIIagm.  called  X  (43,41). 

The  LTR 

The  HIV-2;\jIH_2  632  nucleotides  long.  The  size  of  the 
individual  components  was  derived  by  analogy  with  HIV-2j.gj 
and  STLV- 1 1 1  .  The  U3,  R,  and  U5  are  329,  1  76  and  127  base 

pairs  in  size,  respectively.  While  the  U5  and  R  are  comparable 
in  size  to  the  HIV-2j.qjJ  and  ST  LV- 1 1 1  ,  the  U3  of  the 

HlV-2[g];^_2  is  shorter  than  that  of  either  virus.  A  sequence 
alignment  of  the  U3  region  of  the  HIV-2^IH_2  showed  a  deletion 
of  22  8  nucleotides  in  the  H  IV-2(ij];}^_2  U3  region.  The  deletion 
occurred  60  nucleotides  3'  of  the  polypurine  tract  (Figure  11). 
Since  the  deletion  Is  present  in  both  the  5'  and  3'  LTRs,  it 
is  probably  present  in  the  provirus  and  is  not  a  cloning 
artifact.  The  position  of  the  regulatory  sequences  in  the 
H  lV-2  LTR,  such  as  the  TATAA  box,  the  po  1  y  ade  n  y  1  a  1 1  on 

signal  AATAAA,  and  the  SP 1  binding  sites,  are  indicated  in 
Figure  11. 


The  gag  and  pol  gene  proteins:  relationship  to  other  lenti- 
vi r us  es 

The  first  open  reading  frame  derived  from  the  nucleotide 
sequence  of  the  HIV-2j^jf^_2  corresponds  to  the  gag  precursor. 

In  the  infected  cells  the  size  of  the  gag  precursor  appears 
to  be  55  Kd  as  judged  by  immunoprecip itat ion  with  human  sera 
from  Infected  individuals  (35).  The  gag  open  reading  frame 
has  a  coding  capacity  of  519  amino  acids.  Although  definitive 
evidence  has  not  been  provided  yet,  the  gag  precursor  should 
exhibit  the  sam.e  size  as  in  HIV-1,  p55  (50).  The  amino  acid 
homologies  of  HIV-2|\jIH-Z  gs g  precursor  polypeptides  when 
compared  with  HIV-2f.Qjj,  STLV-1 1 1  and  the  HTLV-IIIB  strain 

of  HIV-1  are  92%,  82%,  and  52%  respectively  (Table  5).  This 

indicates  that  the  two  human  HlV-2s  are  somewhat  more  closely 
related  to  each  other  than  to  STL  V- 1 1 1  and  that  HIV-1  is 

much  more  distantly  related  to  all  of  them.  The  cleavage  site  for 
the  major  core  protein  (p24)  In  the  HIV-2t^IH-Z  gag  precursor 
can  be  provisionally  assigned  by  alignment  of  the  amino  acid 
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sequences  of  the  gag  precursor 
and  ST  LV  -  1 1 1  (Figure  12). 

These  alignments  were  performed  using  the  algorithm  of 
Dayhoff  and  colleagues.  These  sequences  were  aligned  with 
those  of  the  major  gag  proteins  of  Vlsna  and  EIAV  in  order  to 
establish  their  respective  phylogenetic  relationship 
(Figure  12).  The  results  expressed  as  percentage  of  amino 
acid  sequence  are  summarized  in  Table  6.  The  analysis  of  the 
p24-26  proteins  indicates  that  HIV-2;q];jj_;7  and  HIV-2j.q(^  are 
highly  related  (96%)  and  are  equally  distant  from  STLV- 1  1 1 
(88%).  Both  HIV-2s  and  STLV-III^qvj  appear  to  be  equally 
distant  from  HIV-1  (66-68%).  Equal  homology  (28-29%)  is 
found  between  EIAV  and  HIV-1,  both  HIV-2  Isolates,  and 
STLV-III^q^.  Similarly,  Visna  virus  shares  only  24-26% 
homology  with  HIV-1,  HIV-2  and  STLV-I I  ly^^vf .  Thus,  the  com¬ 
parison  suggests  an  ancestral  relationship  between  the  ungulate 
retroviruses  and  the  primate  immunodeficiency  viruses,  but 
also  indicates  that  the  divergence  of  the  primate  immuno¬ 
deficiency  viruses  occurred  much  later  while  HIV-2  and 
STLV- 1 1 1  diverged  later  still. 

The  p o 1  open  reading  frame  of  1190  amino  acids  (Figure  11) 
overlaps  the  gag  precursor  open  reading  frame  as  in  HIV-1. 

The  overall  amino  acid  sequence  relationships  among  the  p o  1 
genes  of  both  HIV-2  Isolates,  HIV-1  and  STLV-III  are  like 
those  of  the  gag  genes  and  lead  to  the  same  conclusions. 

Conserved  and  variable  domains  in  the  envelope  of  HIV-1, 

HIV-2  and  STLV-IIlArM 

The  third  major  open  reading  frame  in  the  HIV-2fjjn_2 
provlrus  corresponds  to  the  envelope  protein  (Figure  12)  and 
potentially  encodes  for  856  amino  acids.  A  comparative 
analysis  of  the  envelope  protein  of  HIV-2fjm_2  ^^'^“^rod 

and  STLV-IIl^Cj.(  showed  an  overall  homology  of  80%  and  70% 
respectively.  The  degree  of  conservation  Is  comparable  In 
the  extracellular  and  transmembrane  envelope  proteins 
(Table  5).  The  envelope  proteins  of  HIV-2\jh-Z  HIV-I  are 

much  less  related  (35%),  as  shown  in  Table  2.  A  homology 
comparison  between  HIV-1  and  HIV-2  was  undertaken  In  order  to 
identify  conserved  regions  which  may  be  crucial  for  the 
function  of  the  envelope  protein.  The  position  of  the 
cysteines  is  highly  conserved  among  all  these  retroviruses. 

In  the  extracellular  envelope  protein  22  cysteine  residues 
are  conserved  among  both  HIV -2  isolates  and  STLV-IIT^(;jj/(  and 
19  of  22  are  also  conserved  in  the  same  position  In  all 
strains  reported  of  HIV-1  (Figure  13). 

Similarly  In  the  t ra ns memb r a ne  portion  of  the  envelope 
protein  three  cysteines  are  conserved  among  both  strains  of 
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HIV-2  and  STLV-I I  and  two  are  also  conserved  in  the 

HTLV-IIIB  strain  of  HIV-1.  Clearly,  disulfide  bonds  must 
play  a  crucial  role  in  maintaining  the  secondary  structure 
of  the  envelope  proteins. 

Further  analyses  of  the  amino  acids  homology  among  the 
envelope  proteins,  identified  regions  in  which  either  complete 
amino  acid  identity  or  only  conservative  changes  could  be 
detected  in  all  these  viruses.  Of  these  envelope  conserved 
regions  (ECR)  Indicated  in  Figure  13,  the  ECR-6  which  is 
located  in  the  extracellular  glycoprotein  has  been  identified 
as  a  putative  binding  site  to  the  CD4  molecule  which  is  an 
essential  part  of  the  cellular  receptor  for  HIV-1,  STLV-I I 
(36,37,38)  and  most  likely  HIV-2.  A  peptide  (T-1)  capable 
of  inducing  cell  mediated  immunity  in  mice  has  been  syn¬ 
thesized  using  the  sequence  of  the  ECR-6  region  from  HIV-1 
(51).  Regions  ECR-7  and  ECR-8  which  are  located  at  the  amino 
terminus  of  the  transmembrane  envelope  protein  may  represent 
regions  Involved  in  the  repetitive  folding  of  the  protein 
within  the  cellular  membrane,  as  proposed  by  computer  assisted 
analyses  of  the  envelope  proteins  of  different  strains  of 
HIV-1  (59).  Based  on  the  homology  of  the  ECR7  region  with 
the  fusion  peptide  of  human  paramyxoviruses,  measles,  and 
respiratory  syncytial  viruses  (60),  it  has  been  hypothe¬ 
sized  that  the  first  11  amino  acids  conserved  in  ECR7  may  be 
the  fusion  peptide  of  HIV-2,  HIV-1  and  SILV-III^GM*  Finally, 
the  3'  most  conserved  region  in  the  transmembrane  envelope 
protein  (ECR-12)  has  been  implicated  to  be  involved  in  the 
cytopathlc  effect  i_n  vitro  (61)  even  though  others  (52)  have 
reported  contrary  results.  Despite  these  differences,  which 
may  be  related  to  the  use  of  different  target  cells,  the 
conservation  of  these  17  amino  acids  in  HIV-1,  HIV-2  and 
STLV-III^GM  biologically  significant. 

In  HIV-2fjjH_2  infected  cells  a  protein  of  33  Kd 
(compared  to  41  Kd  in  HIV-1  infected  cells)  believed  to 
be  the  transmembrane  envelope  protein  has  been  identified 
(unpublished  data).  Similarly,  a  truncated  form  of  the 
t  ransraembrane  protein  has  been  identified  in  STLV-IIIy^GM 
Infected  cells  (31).  The  en  v  gene  of  STLV- 1 1 1  contained  a 
termination  codon  that  would  eliminate  the  last  146  amino  acids 
at  the  carboxy  terminus  of  the  t rans membrane  portion  of  the 
envelope  (41,42).  HIV-2{^jj^_2  does  not  have  a  termination  codon 

at  the  same  position.  Since  the  Infected  cells  from  which  it 
was  Isolated  appear  to  express  a  truncated  gp33,  the  provirus 
obtained  in  our  laboratory  is  not  representative  of  the 
majority  of  the  provirus  present.  Although  the  biological 
significance  of  the  stop  codon  in  some  of  these  viruses  is 
not  clear,  it  might  be  Important  in  elucidating  their  biological 
properties.  This  stop  codon  Is  present  In  the  sane  position 
in  some  clones  of  HIV-2j.q^  (62).  Furthermore,  a  comparison  of 
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the  amino  acid  sequence  of  the  t rans membrane  envelope  protein 
of  these  viruses,  shows  a  decrease  in  homology  after  the 
stop  codon  (Figure  13  and  14).  These  data  suggest  that  this 
stop  codon  does  have  biological  significance. 


The  same  regions  that  have  been  found  to  vary  most  in 
HIV-1  (63)  are  generally  the  most  variable  in  both  HIV-2 
isolates  and  STLV-III^q»^  (Figure  13).  The  first  region  of 


variability  among  the  envelope  proteins  of 


NIH-Z . 


and  ST  LV- 1 1 1  spans  from  amino  acid  112  to  190  in  HIV-2{^'[^_2 

and  corresponds  to  the  hy pe r var i ab 1 e  region  from  aal30  to  aa210 
in  HIV-1  (63,64).  The  degree  of  variability  in  this  region 
of  the  West  African  retroviruses,  STLV-III^Qji^,  and  HIV-1  isolates 
were  analyzed.  The  percentage  of  amino  acid  identity  ranged 
from  30-60%.  This  equivalence  suggests  that  both  groups  of 
viruses  may  have  spread  to  their  present  ranges  from  a  limited 
focus  of  infection  at  approximately  the  same  time. 


Identification  of  putative  functional  domains  of  other  viral 


proteins . 


Several  other  genes  have  been  identified  in  the  HIV-1 
genome  by  immunological  (54,55,65,66)  or  functional  studies 
(6,52,53,67).  The  corresponding  genes  can  be  identified  in 
the  HIV-2  genome  and  the  comparative  analyses  of  their  amino 
acid  sequence  indicate  strongly  conserved  domains  within  some 
of  them. 


The  HIV-2  t  rs  /a  rt  gene,  which  was  discovered  in  HIV-1  by 
mutagenesis  of  a  biologically  active  HIV-1  clone  (52,53), 
seems  to  be  crucial  for  the  expression  of  the  HIV-1  envelope 
protein.  Protein  sequence  alignments  of  the  HIV-2  and  HIV-1 
t  r s / a  r  t  gene  products  show  an  arginine  rich  region  in  the 
second  coding  exon  that  is  conserved  among  HIV-1,  HIV-2  and 
STLV-III^q^  (Figure  15).  Similarly,  arginine  and  cysteine 
rich  regions  (Figure  15)  can  be  Identified  in  the  first 
coding  exon  of  the  tat  proteins,  which  are  responsible  for 
the  t r ans ac t 1 va 1 1  on  of  virus  expression  in  these  viral  isolates 
(40,68).  No  recognizable  conserved  regions  were  detected 
within  the  s  or  (Q)  gene  although  the  sor  proteins  of  these 
viruses  share  a  similar  hydropathy  profile  (not  shown).  A 
highly  conserved  region  could,  however,  be  Identified  in  the 
3  '  or f  (F)  gene  although  a  correct  protein  alignment  of  the 
3  ' o r f  protein  product  can  not  be  obtained  because  of  the 
presence  of  the  228  nucleotide  deletion  in  the  U3. 


6.  Conclu'^  ^  on 


The  identification  of  an  increasing  number  of  human  and 
simian  retroviruses  in  the  last  five  years,  make  it  Imperative 
to  determine  their  precise  genetic  relationships  in  order  to 


elucidate  the  genetic  basis  for  their  pathogenic  effects. 
Studies  on  the  replicative  functions  of  HlV-1  and  its  role 
in  T-cell  killing  have  shown  that  these  human  retroviruses 
have  a  more  complicated  mechanism  of  regulation  than  the 
non-primate  retroviruses.  The  discovery  of  a  second  group  of 
viruses  in  both  primate  and  human  which  are  structurally  and 
genetically  related  to  HIV-1  and  are  also  associated  with 
immunodeficiency  calls  for  a  reinterpretation  of  the  natural 
history  of  these  viruses  and  for  a  reevaluation  of  the  hypothe¬ 
sis  that  AIDS  is  a  new  disease.  In  fact,  analyses  of  the 
rate  of  nucleotide  changes  suggests  that  HIV-1  and  HIV-2 
might  have  diverged  from  each  other  as  recently  as  40  years 
ago  (G.  Meyers,  Los  Alamos;  personal  communication)  while  the 
first  documented  cases  of  AIDS  or  aggressive  form  of  Kaposi 
in  young  people  date  as  far  back  as  the  early  1960s  (69). 

The  molecular  characteristics  of  STLV-III  and  HIV-2|giH_2 
reveal  that  their  genetic  organization  is  very  similar  to 
HIV-1.  STLV-II  I^Qjvj  and  HlV-2  are  more  closely  related  to 
each  other  than  to  HIV-1.  Nevertheless,  the  extent  of 
similarity  among  these  primate  viruses  indicate  that  they 
arose  from  a  common  ancestor.  The  Immune  cross  reactivity 
of  ST  LV- 1 1  ,  HIV-1  and  H1V-2j.qjJ  in  the  major  core  proteins 

reflects  the  high  conservation  of  the  gag  gene  sequence. 
Comparison  of  the  gag  protein  of  STL  V-I I  I^qj^  ,  HIV-1  and 
HIV-2  with  other  members  of  the  lentiviridae  family 
demonstrate  that  the  former  represents  a  group  of  viruses 
that  must  have  diverged  from  each  other  more  recently 
than  they  diverged  from  the  latter  group  and  that  STLV-II  I^q),j 
and  HIV-2  diverged  from  each  other  more  recently  than 
either  diverged  from  HIV-1. 

The  s o r  gene  of  STLV-III^qj^  as  well  as  of  HIV-2  appears 
to  be  less  conserved  than  all  the  other  genes.  The 
biological  significance  of  this  observation  is  not  clear 
since  the  function  of  the  23  Kd  HIV-1  so  r  protein  (70) 

1 s  unknown . 

Conversely,  the  high  degree  of  conservation  of  the  central 
region  of  the  3  '  o r  f  protein  between  these  three  viruses 
suggests  that  this  region  may  be  the  functional  domain  of  the 
protein.  The  27  Kd  HIV-1  3  * o r f  protein  is  a  cytoplasmic 
protein  (55)  apparently  not  required  for  viral  replication  i n 
vitro  (61).  The  conservation  of  the  amino  acid  sequence 
of  the  3 ' orf  proteins  would  suggest  its  importance  in  the 
biology  of  these  primate  retroviruses,  perhaps  as  a  negative 
regulatory  element. 


5  One  point  of  considerable  debate  is  whether  the  West 
African  viruses  (HIV-2)  cause  AIDS  in  people.  Early  reports, 
wh  ich  identified  a  human  virus  related  to  STLV-III^qji/j,  on  the 
basis  of  serology,  suggested  a  lack  of  disease  association 
(32)  while  others  reported  the  isolation  of  HIV-2  viruses 
from  a  few  patients  with  immunodeficiency  and  no  signs  of 
infection  with  HIV-1  (33).  More  recently,  an  increasing 
number  of  HIV-2  Isolates  have  been  obtained  from  patients 
with  AIDS  from  West  Africa  (68,  35).  However,  a  retrospective 

seroepidemlologi cal  study  on  4248  people  in  West  Africa 
showed  the  absence  of  any  clinical  signs  in  330  infected 
people  (71).  The  changes  In  the  envelope  proteins  of  two 
HlV-2  Isolates  are  similar  which  suggests  that  HIV-1  and 
HIV-2  have  existed  in  their  present  population  for  similar 
lengths  of  time.  Therefore,  It  is  possible  that  the  dis¬ 
crepancy  between  these  studies  is  due  to  a  lower  morbidity 
rate  for  HIV-2.  A  more  difficult  question  is  whether  there  is 
a  fundamental  genetic  difference  between  the  two  virus  groups 
that  could  explain  their  apparent  different  biological  behavior 
in  infected  individuals.  The  overall  genetic  structure  of 
HIV-1  and  HIV-2  is  very  similar,  with  the  exception  of  an 
extra  open  reading  frame  in  HIV-2,  which  has  been  designated 
X.  No  evidence  is  currently  available  which  would  indicate 
whether  or  not  X  is  translated. 

The  presence  of  a  stop  codon  in  the  t rans membrane  portion 
of  the  env  gene  of  some  of  these  viruses  opens  the  possibility 
that  two  different  forms  of  the  envelope  small  protein  could 
be  synthesized.  The  biological  significance  of  this  phenomena 
in  vitro  and  vivo  remains  to  be  investigated. 
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Construction  and  properties  of  sor  mutants  of 
HIV-1.  Plasmid  X  was  generated  from  pHXB2gpt  by 
removal  of  an  EcoRl  site  in  the  polylinker  region. 
Mutant  S  was  prepared  by  digestion  with  Ndel  and 
Ncol  and  religation.  Mutants  6.9,  3.3,  and  153.0 

were  generated  by  site  directed  mutagensis  such 
that  stop  codons  were  introduced  in  frame  at  the 

locations  marked  - X -  (see  text).  Mutated 

fragments  were  recloned  into  clone  X  and  trans¬ 
fected  into  H9  cells  by  protoplast  fusion.  Cul¬ 
tures  were  monitored  for  HIV  1  production  throughout 
the  course  of  the  experiment. 
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Construction  of  packaping  defective  mutants. 

A  series  of  deletion  mutations  in  the  region 
between  the  5'  LTR  and  gag  were  constructed 
from  pHXB2gpt.  The  clone  was  cut  with  either 
Sad  or  BssH2,  subjected  to  Bal3l,  blunt  ended 
with  DNA  polymerase,  and  religated.  Mutants 
X 10-1-25  and  #293  were  obtained  by  site  direc¬ 
ted  mutagenesis. 
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Plasmid  clone  C15CAT  was  constructed  by  ligation 
of  Hindlll  linkers  to  the  blunt  ended  PstI  insert 
of  C15,  digestion  with  Hindlll  and  ligation  of  the 
resulting  fragment  into  the  Hindlll  site  of  pSVOCAT. 
Deletion  mutants  in  LTR  were  made  by  Bal31  exonuclea 
digestion  from  the  Kpnl  site  in  the  direction  of 
arrows. 
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Features  of  clone  C15  and  resultinff  5’  deletion 
clones  containing  HTLV-III-LTR  sequences.  CD12 
contained  a  small  deletion  around  the  Kpnl  site 
by  restriction  enzyme  analysis  while  retaining 
the  surrounding  Xhol  and  Bglll  sites.  Each 
clone  is  numbered  corresponding  to  the  distance 
in  nucleotides  from  the  transcriptional  start 
site  at  4-1.  Regions  of  homology  to  other  genes 
and  biologically  relevant  features  are  diagrammed 
as  indicated  above  the  map.  CAT  assay  results 
of  lysates  prepared  from  transfected  H9  and  H9/1II 


cells  are  shown  on  the  right  of  each  clone.  Values 
are  percent  conversion  of  ch 1 o r a mphe n i c a  1  to 

acetylated  metabolites. 
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R  PGOR  KKYMLKHVVWAAHEL 

640  720 

AGATACATTrOGATTAGCAGAAAOCCTCTTGCAGAACAAAGAAOGATGTCAAAAAATACT 
GBrciACSLLEMKECCOKlL 

7S0  780 

TTCGGTCTTAGCTCCATTAOTGCCAACAGGCTCAGAXAATTTAAAAACCCTTTATAATAC 
SVLAPLVPTCSEHLKSLYHT 

810  840 

rGTCTGCGTCATCTGGTCCATTCAGCCAOAAGACAAACTCAAACACACTCAGCAAGCAAA 
VCVIWCIHAEEKVRHTEEAK 

87C  900 

ACAGATAGTOCAGAGACACCTAGTGOTOGAAACAGCAACAGCAGAAACTATGCCAAAAAC 
gi  VQPMLVVETCTAETHPKT 

024 

9)0  960 

AJkGTAGAGGAACAGCAGCATCTAGCCGCAGAGGAGGAAATTACCCAGTACAACAAATACG 

spptapsscrcgkypvqqic 

990  102C 

TOGTAACTATGTCCACCTGCCATTAAGCCCGACAACATTAAATCCCTGoGTAAAATTGA? 
GHYVHUPtSPRTLHAHVKLl 

los;  1C8C 

AGAGGAAAAGAAArrTGGAGv'AGAAGTAGT'3.rGAGGArTTCAGvCACr3rCAGAAGGCT5 
EEKR  FGA£VVPorC*LSEGC 

in:  iuc 

GACCCrCTATGACATTAAT'TAGATGrrAXArrGTGTAGGAGACCATCAAOCGGCTATO'A 
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OATTAT 'AGAGATArTATAAATOAGCAGGrTGTAGATTGOGACTTGCAcCACCrACAACC 
::PG::NCEAAGwoLvHr»p 

;:k  ;:6', 
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»  ;  .  LA'-,:.;  n 


AAAGAJkGCCCCCCAATTTCCCCATGGCTCAACTGCATCAOOOGCTCACCCCAACTOCTCC 
K  K  PRNfPKAOVHQCLTPTAP 
KEAPOFPHGSSASCAOANCS 
19S0  1980 

L'CCAGAGGACCCAGCTGTGGATCTGCrAAAOAACTACATCCACTrOGGCAAGCAGCACAG 
p£OPAVDLLKNY«0LCFg0» 
PPGPSCCSAKELHAVCgAAC 
2010  2040 

AGAAA&CAGACAOAAGCCTTACAAGCAGCTGACACAOCArrTGCTOCACCTCAATTCTCT 
ESREKPYAEVTEOLLHLNSL 
RKOREALgCCDPCFAApgrS 
QAQ— H  2070  2100 

rTTTGCAGCACACCAGTACTCACTGCTCATATTCAAOGACAGCCTCTAGAACTATrATTC 

r  G  G  D  g 

LWRRpVVTAH  I  ECgPVEVLL 

21)0  2160 

gatacaggccctcatgattctattgtaacaooaatagagttaggtccacattatacccca 
DTGADDSIVTGIELGPHYTP 

2190  222C 

AAAATACTAGGAGCAATAGGAGCrnTATTAATACTAAAGAATACAAAAATGTAGAAATA 
IcrVGCrGGFIMTKCYKKVCI 

22S0  2280 

GAACTTTTACCCAAAAGCATTAAAACGACAATCATGACACGGCACACTCCGArrAACArr 
EVLGRRlRRTINTGOTPIMt 

h— PET 

2)10  2)40 

tttcgtaggaatttgctju^cagctctexxwatgtctctaaatcttcccatagctaaqcta 

FCRNLLTALCNSLMLPIAKV 

2)70  2400 

GACCCTGTAAAAGTCCCCTTAAACCCAGGAAAaCTTGCACCAAAATTGAACCAaTGGCCA 

EPVKVALKPGRVGPRLRgwP 

24)0  2460 

TTATCAJLAAGAAAAGATACTTGCATTAAGAGAAATCTGTCAAAAOATGGAAAAGGATOGT 
LSKEKIVALREICEKHEKDC 

2490  2S20 

CAGTTGCAGGAAGCTCCCCCGACCAATCCATACAACACCCCCACATTTOCCATaAAGAAA 
gLEEAPPTHPYNTP  yrAIKK 

2SS0  2S80 

aaagataagaacaaatocacaatgctgatacattttacgcaactaaataooqtcactcag 
KDRNFWRnLIHPRELNRVTO 

2610  2440 

gaactttacacaagtccaattagaataccacaccctccagcactagcaaaaaooaaaaoc 
ELYRSPIRZPHPACLAERRR 

2670  2700 

ATT  AC  A  GTACTGGATATAGCTGATGCXTATTrCTCCATACCTCTAOATGAACAATrrAOG 
ITVLDICDAYFSIPLDEEr* 

27)0  2760 

CACTACACTCCCTTTACTTTACCATCACTAAATAATCCAGAOCCAOGAAAACGATACATT 
OYrAFTl.PSVHKAEPGKRYI 

2790  2820 

TATAyAGGrrCTCCCTCAGGOATGCAAGGOGTCACCACCCATCTTCCAATACACTATOAGA 
YFVLPgCWKGSPAirgYTNR 
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CATGTOCTACAACCCTTCAGOAACCCAAATCCAGATCTGACCTTACTCCAffTATATOGAT 
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GACArCTTAATACCTACTCACAGCACAGACCTCCAACATCACAGOCTACnTrACAOCTA 
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»  .  LTTTNG^XELEAIYHCl  E 

*8i0  1840 

,*  'T-'AGGGCCXAAGAGAAATATTATXGTXGXATTACAAGTATCTTATCGOAATXArXAC 
r  S  C-PKRNI  JVELOVCYGNNN 

1870  1900 

AGOTTrrrrACAaAATCAGAGAGCAGGCTAGTTAACCAAATAATACAACAAATGATTAAA 
BFPTESESBLVNQl  ICEMIK 

19)0  1960 

jtcagogtttatgtagcatgggtaccagcattacaaogtataggagcaaaccaacaaata 

VRVYVAWVPALEGICGHOCI 

1990  4020 

GGTCCACTAGTTAGTCAGGGGTTTAGACAAGTTCTCTTCTTGCAAAAGATAGAGCCAGCA 
GPLVSOGFBOVLFLEKIEPA 

40S0  4080 

CAAGAAGAACaTGATAAATACCATAGTAATGTAAAAGAATTCCTATTCAAATTTGCATTA 
CEEHDRYHSNVUELVFRFGL 

4110  4140 

cccacaatactggccagacaoatagtagacacctgtcataaatgtcatcagaaaggagaa 

PSIVAFOIVDTCOAChCKCE 

4170  4200 

gctatacatgggcaggtaaattcagatctaoogacttggcaaatcgactctacccatcta 

AI  HGOVNSDLGTWOMdCTHL 

42)0  4240 

caagcaaaaatagtcatagttgcagtacatctagctagtocattcatagaaccacaagta 

EGKlVIVAVHVASGFlEAEV 

4290  412C 

ATTCCACAAGAGACAGGAAQACAGCACTATTTCCTGTTAAAATTCGCAGGCAGATCCCCT 
1  POETGROHYFLLKLaGFWP 

4)SC  4)80 

tatttacacatctacacacacagtaatggtgctaactttgcatcgcaagaagtaaacatc 

YL.HIYTHSNGAHFASOEVKH 

4410  4440 

GTTACATGGTGGGCAGGGATAGAGGCACACCTrCCGGTACCATACAATCCACAGAGTCAC 
'.  TWWAGIEAHLWVPYNPOSO 

4470  4^00 

ggagtagtggaagcaatgaatcaccacctgaaaaatcaaatagatagaatcaggcaacaa 

GVVEAnNHHLKNQIORtREO 

4930  4960 

GCAAATTCAGTAGAAACCATAGTATTAATGGCAGTTCATTGCATGAATTTTAAAAGAAGC 
ANSVETIVLHAVHCNNFKDB 

pcWvpunn* 

4990  4620 

GGAGGAATAGGGGATATGACTCCACCAGXAAGATTAATTAACATGATCACTACAGAACAA 
GGIGOMTPAERLIHMITTEO 
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GGTTTOCTArCTGCCCCArrTTAAGOTCOGATGGGCATGOTOGACCTCCAGCAGACTAAT 
.•  .•yvpHFRVGWAWWTCSRVI 
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•rrrC'CCrACAGOAAGCAAGCCATTTAGAAGTACAACOGTATrOOCATTTGACACCAGA 
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AAOAGOGTOGCCCAGTArrTATCCAGTOAOGATAACCTGGTACTCAAGCCACCTTTTGGA 

PGwrr.  t  vavritwysrollo 
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•A  ,ArGTAA''ACCAGArTArGCAGGCAT7rrrcCTGCATAGCACTTATTTCCCTCTrrAC 
K’NTPI.  -‘PHFSCIALISLFT 
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ATACATCTTCTGCATCAAACAAGTAAGTATGGCATGTCTTOOGAATCAC.CTGCTTATCGC 
YI  FCl  KOVSMGCLGNgLL  :  A 

979.  is:: 

CATCTTGCTTTTAAGTGTCTATGGGAT CTATTGTACTCAATATCTCACAGTCTTTTATGC 
1  LLLSVYC!  YCTOYVTVF  VC 

9890  saac 

TGTACCAGCTTGGAGGAATCCGACAATTCCCCTCTTCTCTGCAACCAAGAATAGGGATAC 

VPAWRMATIPLFCATRNRDT 

9910  9940 

TTGCGCAACAACTCACTGCCTACCAGATAATGGTCATTATTCACAATTGGCCCTTAATGT 
WGTT'JCLPONGDYSELALNV 

9970  6000 

TACAGAAACCTTTCATGCTTGCGACAATACACTCACAGAACAGCCAATAGAGGACGTATG 
TESFDAWENTVTEpAIEDVW 

6010  6060 

OCAACTCTTTCACACCTCAATAAACCCTTCTGTAAAArrATCCCCATTATCCATTACTAT 
gLFETSIKPCVICLSPOClTM 

6090  6120 

GAGATCCAATAAAAGTCAGACAGATAGATGGGGATTGACAAAATCATCAACAACAATAAC 
RCKKSETDRWGLTPCSSTTIT 

6190  6180 

AACACCACCACCAACATCAGCACCAGTATCACAAAAAATAGACATCGTCAATGAGACTAG 
TAAPTSAPVSERILiM  '.  NETS 

6210  6240 

TTCTTCTATACCTCACAATAATTCCACACCCTTCGAACAACACCAAATGATAAGCTGTAA 
SCIAONMCTCLEOEOMISCI' 

6270  6)0C 

ATTCACCATGACAOGGTTAAAAACAGACAAGACAAAGGAGTACAATGAAArTTCGTACTC 

FTMTGLKRDKTKEYNETWYS 
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TACACATTTGGTTTCTCAACAAOGGAATAGCACTGATAATCAAACCAGATGCTACATGAA 
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TCACTGTAACACTTCtGTTATCCAAGAGTCTTCTGACAAACATTATTGGCATACTATTAG 
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CTOCCATOCTAOGGATAATXOOACTATAATTAGTTTAAATAACTATTATAATCTAACAAT 
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CAAATCTACAAGACCAOCAAATAAOACAOTTTTACCAGTCACCATTATGTCTCGATTGGT 
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COMPARISON  OF  THE  GENOMIC  ORGANIZATION  OF 
THE  STLV-III  AND  HIV-2rod  PROVIRUSES 
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ATrAGGTTArGGCCCGGGGGAAAGAAAAAGTACAAGrrAAAAGATATTGTGTGGGCAGCG 
:  R  L  ?  F  ;;  G  K  K  K  y  K  I  f-  H  I  V  W  A  A 

690  700 

aatgaattggacagattcggattagcagagaccctgttggagtcaaaacaagcttgccaa 
SELDPFGLAESLlEStCECCC 

7:0  780 

AAAATTOTTACA  jTTTTAGATCCATTAGTA'TCGACAGOTTCAGAAAXTTTAAAAAGCCTT 
KI  LT'.’LDPwVPTGSESLKSL 

810  840 

TTTAATACTGT-TTGCGTCATCTGGTGCATACACGCACAAGAGAAAGTGAAAGATACTGAA 
F  N  7  C  :  w  c  I  H  A  E  E  K  V  K  D  T  E 
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GGAGCAAAACAAATAOTACAGAGACATCTAGTGGCAGAAACAGGAACTGCAGAGAAAATG 
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7:AAATA:aaGTA’jA77AACAGCACCACCTAGCGGGAAAAACTTCCCCGTGCAGCAAGTA 
?N7S.-  FTAFPSGKSFPVOgv 
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G:TGCCAA:TATA::-7A7ATACCACTGAGTCCC0GAAC7CTAAATGCTTGGGTAAAATTA 
AGJiYTHlPLSPGTLNAWVKL 

lOSO  1080 

GTGGA" GAAAAGAAArrCGGGGCAGAAGTAGTGCCAGCATTTCAGGCACTCTCAGAAGOC 
VEF.  F  r  FGAEVVPGFvALSEG 
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GACCATCAAGCACCTATC 
0  H  5  A  A  « 


nvo  1000 

-AAA7AA7'':a-.GGAAA7TAT7AA70AGCAAGCAGCAGAC7GGGA7GT7GCACA7CCCATA 

;  :  :  ?  E  :  :  s  f  e  a  a  :  -  g  v  a  k  f*  i 


10)0 

GCAGCTTAGAGAGCCAAGA 
g  p  E  p  F 


1060 

G7CTCACATAGCAGG0ACA 
G  D  ;  A  G  7 


1090  noo 

-i.  i  ;7A',AA  UG-:AGA7'*’*AGTGr,ATG7rTA0A::A.:AAAATCCTGTACCAGTA 

7  .  T  y  :  0  w  «  r  r  r  c  N  p  V  p  V 

1  *  .-c  ^  ^  ^  ^  ^  ^  1  JflO 


1 4-;^ 

-AAAg:7AT'-.7AGA7 


14  7,; 

A  '» A  A'.  AAA  . 


18)0  1860 
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A.ACAOOAGCCGCACAGAG/kATArrxAATTTAAA&CACCACCCACACCCTCA-;ACCCACAJk 

nbsbteniafkapgrcs-^e 


CGACACACATCTTOGCCTTGCCCGATACCATATATACATTTCCTCATCCGCCTOCreATT 

GDRSwpwPIAYIHFLIRLLX 


72<j0  ’JiO 

CTAACATACATGTGCACTAACTGCAGACCAGAATCTCTCTACTCCAACATGACrTGGTTC 

VTYMWTNCRCCSLyCHKTt-'r 


84)0  8480 

CCCCTCTrGACCOCCCTATACAACATCTCCAOGGACTTACTATCCAGGATCTCCCCGATC 
RLLTCLYNICRDLLSRISPI 


7)^0 

CTCAATTCGCTACAGAACAGGACGOCTCAGAAACAGCCCAATTATGCACCGTGCCCTATA 

LMWVENRTCgAQRNYAPCRl 


8490  8S20 

CTCCAACCAATCTTCCACAGTCTCCAGAGAGCACTAACAGCAATCACAGACTOCCTCAGC 
LOPirgSLORALTAlRDWLR 


7410  ’<<0 

AGGCAAATAATTAATACCTGGCACAGGCTAGGGAAAAATCTATATTTGCCTCCCA&CCAA 

RQl  iHTWHfiVCRNLYLP?8E 
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ccccagttgacctgcaactcaacggtgaccaccataattcccaacattcatccgcgagat 
GELTCHSTVTSt  lANI  QACO 
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CAAACAAATATTACCTTTACTGCAGAOGCCCCAGAACTATACCCATTOGAATTiOGCCAT 

OTRlTrSAEAAELYRLE-CO 
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tacaaattactagaaatcacaccaattggcttcocacctacatcactaaagacatactcc 
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CCAATCCCCACGGOCATACTTOCACTCCCACCAAOCATCAOGCACOGAOCAGAACTCGCC 
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CTCCTCTCACCCArACCCATATCACCACCCACAmTCTAAATACCCCATGGAGAACCCC 
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ACCACCACAAACCCACAAACAATTCTACAAACACCAAAATATGGATCATCTAGArCTAGA 
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ATCTTOCGACTGACCCTCTCGOGAACAAAAAATCTCCAGOCAAGACTCACTGCtATACAC 

HLRL.TVWGTlCHLgARVTAIC 
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TTATA^mrAGACAAGACATACAATCTTACACTTATACTTACACAJlCCCTTTTArTCTCTA 
YSORRHRlLDLYLDRArTLY 
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AACTACCTAAAGGACCAGCCGCACCTAAATTCATOOOGATCTCCGTTTAGACAAGTCTCC 

RYtROO^^Ot-NSWCCArSOVC 
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CCCACACCAATTTCGCCATAATTCACCACTCCCACACAAACACTCGAXCCCGASArrCAA 
fCErCHMSCLPEREWKARLK 
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CACACTTCTCrrACCATXlGCTAAATGATACCTTGACACCTCATTCGAACAATArCACCTCC 

HTSVPWVROTLTPOWHKJtTW 
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AGCAAOCCCAATACCATTTACTTAAAAACAOCAACACCTATACTTOCTCACCCCACCAAC 
ARCl  PFS^rORF 
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CAGCAATGGGXACAAAAAGTCCGCTACCTOGAOGCAAATATCACTCAAACTCTAGA^AA 

QewegAVRYLCARtsgsLcg 
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TAGCTACTCACAACAGCTCAGACTCCAOOCACTTTCCACAAOOOOCTCTAACCACCAjXc 
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CCACAXATTCAOCAACAAAACAATATCTATCAOCrACAAAAATTAAATAGCTiGGATCTr 
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Fig.  11  The  com  pletc  nucleotide  sequence  of  the  HIV-2M-fn_7  viru: 


The  D  N  A  sequence  has  been  organized  and  presented  as 
the  U  ■)  -  R  -  g  a  g  -  p  o  1  -  m  i  d  d  1  c  r  e  g  i  o  n  -  e  n  v  -  3  '  o  r  f  -  U  3  -  R  .  T  ti  e 
translation  in  amino  acids  of  all  major  open  reading 
fra  me  s  is  p  r  i-  s  e  n  t  e  d  u  nd  c  r  n  e  a  t  h  the  n  u  r  1  e  o  r  i  d  ('  s  e  q  ii  e  n  c  e 
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12  C  o  m  n  Q  r : G  o  n  of  r  h  o  major  c  ore  proteins  of  HIV  1  , 
il  I  V  -  2  TflTv^Tri  ACM  .  EIAV  and  VISNA 
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Fig.  13  Identification  of  variable  and  conserved  regions  in 
the  envelope  proteins  of  HIV-1  and  HIV-2. 
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s  the  extracellular  envelope 
portion  contains  the  trans- 
e  n  V  e  1  o  I'l  e  proteins.  The 
he  HIV-2^-]H_y  envelope  gene 
d  i  r  a  t  i  n  g,  a  p  e  r  f  e  c  t  match 

h  i  1  e  t  h  e  <!  o  t  L  e  d  1  i  n  e 

h  e  ami  n  o  a  rid.  T  h  e  boxes 
V  e  lope  con  s  i>  r  v  e  d  re  g  ions) 

i  h  i  t  only  <'  o  use  r  v  a  t.  i  v  e 

respect  to  the  H  T  I .  V  -  I  1  I  li 

t  t  «■  d  o  V  a  Is  i  n  d  i  r  a  t  e 

c'  r  <■  s  i  d  11  <'  s  i  n  h  n  t  h  HIV- 

d  H  1  V  -1  (  MTI.l'  -  I  I  I  II  )  .  Tile 

t  e  i  II r  c  s  i  d.  e  '  ■  s  'hat  ate 
s  t  A  !  I  i  c  a  !]  i  r  a  1  isolate  s 

c  .1  t  i  o  r,  o  t  t  !i  '•  tor  m  i  n  a  t  i  o  ii 


:;ti.v  1  I  I 


gpl20(130) 


Termination 

Codon 

I  COOH 

^7aa  I  144aa  |  STLV-IIIagm  ENVELOPE 
gp32? 


HIV-2rod 


HIV-I  (HTLV-IIIB) 


S  c  h  o  mn  t  i  c _ r  e  p  r  o  ;;  <•  ii  t  ,i  t  i  <i  r  •  > ; 

of  STL  V -  III  n  II  <i  p  !•  r  1  '  ■  II  f  ' 

V  a  t  i  o  n  in  t  h  <  •  n  t  ti  •  •  r  v  i  t  i  !  i  ■  •  ■ 

after  the  s  t  o  [i  <-  o  <1  <>  n  . 
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A  n  r  h  (‘  iTi.'i  I  i  <■  r  >•  p  r  ii  i  ■ 

r  e  p  i  o  n  o  !  l  li  •  ■  o  n  v  <  •  1  .  i  p  . 

t  h  r  o  V  (■  r  a  11  p  e  r  r  '■  ii  l  .i 

a  11  d  both  II  i  -  ?  a  p.  d  HI 

the  lino,  t  li  h  '  w:p  ,  1  , 1  v 
r.i  p  m  1)  r  a  n  p  p  r  o  t  p  i  n  and  i 

p  r  o  t  p  i  n  is  r  p  ;i  i  c  •  ■ ,  .  ■  i ;  •  .  ■  d 


'  '  I  a  It  n  p  ni  li  I  a  n  p 

p  r  p  ■■  p  II  I  p  d  and 

I  n  i  >  n  1'  S  T  1,  V  '■  1  I  I  1.^ 

:  p  .1  .  r  n  n  p  a  I  h 

I'll  ' '  1  I  a  1  frails 
i  I  ;  >.<  f  I  I  I ;  s  n  p  n  1 1  t  a  n  p 

P  a  I  '  p  r  f  P  .■  s  t  p  p 
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Amino  acid 

DEL 

E  T  I  0  N 

S 

R  E  P  L  I 

C  A  T  I 

0  N  & 

changes  in 
env  (gp41) 

TRANS 

MISS 

I  0  N 

Replication 

Transmission 

Construct 

tat/trs 

LTR/PPT 

3'orf 

(RT/Cos-l) 

Molt3 

H9 

LR  329  ' 

-0 

+47 

+ 

+++++ 

+++++ 

+++ 

LR  206 

-4 

+59 

+ 

+++++ 

(t) 

+++ 

XlO-1 

-5 

+  15 

+ 

+++++ 

+++++ 

+++++ 

X9-3 

-5 

+>70 

+ 

+++++ 

(t) 

(t) 

LR  369* 

-6 

+2 

+ 

not  tested 

++ 

+++ 

LR  295 

-14 

+2 

+ 

+++++ 

++ 

+++++ 

LR  372* 

-15 

+4 

+ 

not  tested 

(t) 

+++ 

LR  429* 

-15 

+4 

+ 

not  tested 

(t) 

+++ 

LR  269 

-17 

+2 

+ 

++++ 

(t) 

not  tested 

LR  306 

-22 

+2 

+ 

not  tested 

not  tested 

LR  319 

-30 

+ 

+ 

not  tested 

- 

not  tested 

LR  468* 

-33 

+0 

+ 

+++ 

(t) 

+++ 

LR  362 

-37 

+  18 

+ 

+++ 

- 

not  tested 

LR  358 

-6 

+3 

+ 

+ 

++ 

(t) 

LR  318 

-6 

+24 

+ 

+ 

+++++ 

(t) 

not  tested 

LR  204 

-6 

+27 

+ 

+ 

+++++ 

(t) 

not  tested 

LR  312 

-14 

+74 

+ 

+ 

+++++ 

II 

LR  192 

-17 

+11 

+ 

+ 

+++++ 

- 

II 

1 

LR  189 

-41 

+0 

+ 

+ 

+++++ 

- 

II 

LR  274 

-42 

+2 

+ 

+ 

+++++ 

II 

LR  194 

-76 

+9 

+ 

+ 

+++++ 

- 

II 

LR  360 

-87 

+32 

+ 

+ 

+ 

not  tested 

LR  327 

-117+20 

+ 

+ 

+ 

- 

- 

II 

LR  230 

none 

? 

+ 

+ 

+++ 

not  tested 

LR  330 

none 

+ 

+++++ 

+++++ 

II 

KEY  **  Denotes  the  presence  of  linkers  in  the  plasmid  construct 

'Replication'  was  assessed  as  the  production  of  reverse  transcriptase  in 
supernatants  taken  from  cos-1  cells,  6-7  days  after  transfection.  'Transmission' 
was  judged  by  the  rise  in  HIV-1  positive  cells  among  cultures  of  either  H9  or  Molt3 
cells,  after  coculture  with  transfected  cos-1  cells,  't'  denotes  transient  HIV-1 
expression  in  cultures  where  no  infected  cells  could  be  detected  by  week  4. 

+  <.01%  expression  by  week  4 

++  >.01-<1%  expression  by  week  4 
+++  1-10%  expression  by  week  4 
++++/++++++  50-90  %  expression  by  week  4 


TABLE  2;  Sunnnary  of  Biological  Properties  of  Deletion  Mutants  In  3'orf. 
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CAT  ASSAY  RESULTS  IN  T-CELLS  (H9)  AND  HTLV-III  INFECTED  T-CELLS  (H9/III) 


Plasmid 

SVO 

SV2 

RSV 

C15CAT 

CD12CAT 

VHHCAT 

-65 

-48 

-65E2 

-65E5 

-48E9 

-48E14 

-48E8 


Description 


20  min.  assay 
H9/III  Exp.  2 


15  hr.  assay 
H9  Exp.  2 


15  hr.  assay 
H9  Exp.  3 


Negative  Control 

0.0 

0.0 

0.0 

Positive  Control 

2.6** 

6.7 

±  1.0 

4.6 

*  1.5 

It 

18.8 

+ 

4.8 

48.4 

±  3.41 

55.8 

t  3.1 

HTLV  III  LTR-CAT 

20.8** 

0.1 

±  0.04 

ND 

II 

88.8  J 

h 

10.9 

0.7 

±  0.05 

0.70 

±  0.080 

II 

92.7 

+ 

1.3 

ND 

2.4 

±  0.34 

del . 

-65  from  CAP 

32.1** 

0.09 

t  0.03 

0.0 

del . 

-48  from  CAP 

0.0 

ND 

0.0 

5'-3' 

-105  to  -80 

99.1 

0.1 

MD 

2.7 

±  0.53 

3‘-5' 

-80  to  -104 

99.2 

0.1 

ND 

0.85 

*  0.31 

5'-3' 

-105  to  -80 

26.7** 

ND 

0.21 

t  0.061 

5'-3' 

-105  to  -79 

22.4** 

ND 

0.18 

*  0.006 

3'-5' 

-80  to  -104 

53.8 

7.9 

ND 

0.27 

i  0.093 

del 

.  -117  from  CAP 

97.5 

+ 

0.8 

0.4 

t  0.07 

0.66 

O 

41 

BS  del 

.  Bgl II  to  SstI 

0.13 

0.02 

0.2 

±  0.09 

ND 

S  del 

.  4  bp  at  SstI 

1.2 

0.3 

16.7*’ 

ND 

Table  3.  Percent  conversion  of  ^^C-chloramphenicol  to  acetylated  metabolites  using 
lysates  of  cells  transiently  transfected  with  the  indicated  plasmids.  Cells  used 
were  H9,  an  uninfected  T  cell  line  and  H9/III  cells,  the  same  cell  line  productively 
infected  with  HTLV  III.  Transfections  were  done  in  triplicate  for  values  showing 
standard  deviations.  Values  for  single  and  averages  for  duplicate  transfections  are 
indicated  by  one  or  two  asterisks  respectively.  ND  means  not  done. 
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